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NOVEL PARASITE ASTACIN METALLOENDOPEPTIDASE PROTEINS 

Cross-Ref erence to Related Applications 
This application is a continuation-in-part of copending 
U.S. Application Serial No. 08/463,994, filed June 5, 1995, 
5 which is a continuation of U.S. Application Serial No. 
08/249,552, filed May 26, 1994, now abandoned. 

Field of the Invention 
The present invention relates to novel parasite protease 

genes, proteins encoded by such genes, antibodies raised 
10 against such proteins, and protease inhibitors produced using 

such proteins. Particular proteases of the present invention 

include astacin metalloendopeptidases and cysteine proteases. 

The present invention also includes therapeutic compositions 

comprising such nucleic acid molecules, proteins, antibodies 
15 and inhibitors, as well as their use to protect animals from 

disease caused by parasites, such as by tissue-migrating 

helminths such as hear two rm. 

Background of the Invention 

Parasite infections in animals, including humans, are 

20 typically treated by chemical drugs, because there are 

essentially no efficacious vaccines available. One 
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disadvantage with chemical drugs is that they must be 
administered often. For example, dogs susceptible to 
heartworm are typically treated monthly to maintain protective 
drug levels. Repeated administration of drugs to treat 
5 parasite infections, however, often leads to the development 
of resistant strains that no longer respond to treatment. 
Furthermore, many of the chemical drugs are harmful to the 
animals being treated, and as larger doses become required due 
to the build up of resistance, the side effects become even 
10 greater. 

It is particularly difficult to develop vaccines against 
parasite infections both because of the complexity of the 
parasite's life cycle and because, while administration of 
parasites or parasite antigens can lead to the production of 
15 a significant antibody response, the immune response is 
typically not sufficient to protect the animal against 
infection. 

As for most parasites, the life cycle of Dirofilaria 
immitis, the helminth that causes heartworm, includes a 
20 variety of life forms, each of which presents different 
targets, and challenges, for immunization. Adult forms of the 
parasite are quite large and preferentially inhabit the heart 
and pulmonary arteries of an animal . Males worms are 
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typically about 12 centimeters (cm) to about 2 0 cm long and 
about 0.7 millimeters (mm) to about 0 • 9 mm wide; female worms 
are about 25 cm to about 31 cm long and about 1.0 to about 1.3 
mm wide. Sexually mature adults, after mating, produce 
5 microfilariae which are only about 300 micrometers (jim) long 
and about 7 ]m wide. The microfilariae traverse capillary 
beds and circulate in the vascular system of dogs in 
concentrations of about 10^ to about 10 ^ microfilariae per 
milliliter (ml) of blood. One method of demonstrating 

10 infection in dogs is to detect the circulating microfilariae. 

If dogs are maintained in an insect-free environment, the 
life cycle of the parasite cannot progress. However, when 
microfilariae are ingested by female mosquitos during blood 
feeding on an infected dog, subsequent development of the 

15 microfilariae into larvae occurs in the mosquito. The 
microfilariae go through two larval stages (LI and L2) and 
finally become mature third stage larvae (L3) of about 1 . 1 mm 
length, which can then be transmitted back to a dog through 
the bite of the mosquito. It is this L3 stage, therefore, 

20 that accounts for the initial infection. As early as three 
days after infection, the L3 molt to the fourth larval (L4) 
stage, and subsequently to the fifth stage, or immature 
adults. The immature adults migrate to the heart and 
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pulmonary arteries, where they mature and reproduce, thus 
producing the microfilariae in the blood. "Occult" infection 
with heartworm in dogs is defined as an infection in which no 
microfilariae can be detected, but the existence of adult 
5 heartworms can be determined through thoracic examination. 

Both the molting process and tissue migration are likely 
to involve the action of one or more enzymes, including 
proteases. Although protease activity has been identified in 
a number of parasites (including in larval excretory-secretory 
10 products) as well as in mammals, there apparently has been no 
identification of an astacin metalloendopeptidase gene in any 
parasite or of a cysteine protease gene in any filariid. 

Astacin metalloendopeptidases, so-called because of their 
similarity to the metalloendopeptidase astacin found in 
15 crayfish, are a relatively newly recognized class of 
metalloproteases that have been found in humans, mice and rats 
as well as apparently in Drosophila fruit flies, Xenopus frogs 
and sea urchins; see, for example, Gomis-Ruth et al., 1993, J. 
Mol, Biol. 229, 945-968; Jiang et al., 1992, FEES Letters 312, 
2 0 110-114; and Dumermuth et al . , 1991, J. Biol . Chem. 266, 
21381-21385. Human intestinal and mouse kidney brush border 
metalloendopeptidases share about 82 percent homology in the 
amino-terminal 198 amino acids. Both share about 30 percent 
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homology with astacin and with the amino-terminal domain of 
human bone morphogenetic protein 1. Members of the astacin 
family share an extended zinc-binding domain motif, the 
consensus sequence of which was identified by Dumermuth et 
al., ibld.r as being HEXXHXXGFXHE, wherein H is histidine, E 
is glutamic acid, G is glycine, F is phenylalanine and X can 
be any amino acid, Gomis-Ruth et al., ihid. , define the zinc- 
binding domain motif as His-Glu-Uaa-Xaa-His-Xaa-Uaa-Gly-Uaa- 
Xaa-His, wherein Uaa is a bulky apolar residue-containing 
amino acid. Jiang et al., ibid. ^ disclose not only the 
extended zinc-binding domain motif, which they represent as 
HEIGHAIGFXHE (underlined letters being strictly conserved) but 
also two other sequences conserved between astacin 
metalloendopeptidases, including RXDRD spanning amino acids 
from about 15 through about 19 and YDYXSIMHY spanning amino 
acids from about 50 through about 58, assuming that the first 
histidine in the extended zinc-binding domain motif is amino 
acid position 1. The three histidines at positions 1, 5 and 
11 appear to be responsible for zinc binding as is the 
tyrosine at position 58. The glutamic acid at position 2 is 
responsible for catalysis. Other key amino acids include the 
glycine at position 8 which is involved in secondary structure 
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and the glutamic acid at position 12 which forms a salt bridge 
with the amino-terminus of the mature enzyme. 

Consensus sequences, particularly around the active 
sites, have also been identified for serine and cysteine 
proteases; see, for example, Sakanari et al . , 1989, Proc. 
Natl. Acad. Sci. USA 86, 4863-4867. Although cysteine 
protease genes have been isolated from several mammalian 
sources and from the nematodes Haemonchus contortus (e.g., 
Pratt et al., 1992, Moi, Biochem. Parasitol . 51, 209-218) and 
Caenorhabditis elegans (Ray et al., 1992, Mol. Biochem. 
Parasitol. 51, 239-250), the cloning of such genes does not 
necessarily predict that the cloning of novel cysteine 
protease genes will be straight-forward, particularly since 
the sequences shared by different cysteine proteases are such 
that probes and primers based on the consensus sequences are 
highly degenerative. 

Heartworm not only is a major problem in dogs, which 
typically are unable to develop immunity after infection 
(i.e., dogs can become reinfected even after being cured by 
chemotherapy) , but is also becoming increasingly widespread in 
other companion animals, such as cats and ferrets. Heartworm 
infections have also been reported in humans. Other parasite 
infections are also widespread, and all require better 
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treatment, including preventative vaccine programs and/or 
targeted drug therapies. 

Summary of the Invention 
One embodiment of the present invention is an isolated 
parasite nucleic acid molecule capable of hybridizing, under 
stringent conditions, with a Dirofilaria immitis astacin 
metalloendopeptidase gene. Such a nucleic acid molecule, 
namely a parasite astacin metalloendopeptidase nucleic acid 
molecule, can include a regulatory region of a parasite 
astacin metalloendopeptidase gene and/or encode at least a 
portion of a parasite astacin metalloendopeptidase protein. 
A preferred nucleic acid molecule of the present invention 
includes at least a portion of SEQ ID N0:1, of SEQ ID N0:2, 
SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 32 and/or SEQ ID NO: 33, 
or the complement thereof. The present invention also 
includes recombinant molecules and recombinant cells that 
include parasite astacin metalloendopeptidase nucleic acid 
molecules of the present invention. Also included are methods 
to produce such nucleic acids, recombinant molecules and 
recombinant cells of the present invention. 

Another embodiment of the present invention is an 
isolated protein that includes a parasite astacin 
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metalloendopeptidase protein or a mimetope of such a protein. 
A parasite astacin metalloendopeptidase protein of the present 
invention preferably has astacin metalloendopeptidase activity 
and/or comprises a protein that, when administered to an 
animal in an effective manner, is capable of eliciting an 
immune response against a natural parasite astacin 
metalloendopeptidase protein. The present invention also 
includes inhibitors of astacin metalloendopeptidase activity 
as well as antibodies that recognize (i.e., selectively bind 
to) a parasite astacin metalloendopeptidase protein and/or 
mimetope thereof of the present invention. Also included are 
methods to produce such proteins, inhibitors and antibodies of 
the present invention. 

Yet another embodiment of the present invention is an 
isolated filariid nematode nucleic acid molecule capable of 
hybridizing, under stringent conditions, with a D. immitis 
cysteine protease gene. Such a nucleic acid molecule, namely 
a filariid cysteine protease nucleic acid molecule, can 
include a regulatory region of a filariid cysteine protease 
gene and/or encode at least a portion of a filariid cysteine 
protease protein. A preferred nucleic acid molecule of the 
present invention includes at least a portion of SEQ ID NO: 12. 
The present invention also includes recombinant molecules and 
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recombinant cells that include filariid cysteine protease 
nucleic acid molecules of the present invention. Also 
included are methods to produce such nucleic acids, 
recombinant molecules and recombinant cells of the present 
invention. 

Another embodiment of the present invention is an 
isolated protein that includes a filariid cysteine protease 
protein or a mimetope of such a protein. A filariid cysteine 
protease protein of the present invention preferably has 
cysteine protease activity and/or comprises a protein that, 
when administered to an animal in an effective manner, is 
capable of eliciting an immune response against a natural 
filariid cysteine protease protein. The present invention 
also includes inhibitors of cysteine protease activity as well 
as antibodies that recognize (i.e., selectively bind to) a 
filariid cysteine protease protein and/or mimetope thereof of 
the present invention. Also included are methods to produce 
such proteins, inhibitors and antibodies of the present 
invention. 

Yet another embodiment of the present invention is a 
therapeutic composition capable of protecting an animal from 
disease caused by a parasite. Such a therapeutic composition 
includes at least one of the following protective compounds: 
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an isolated parasite astacin metalloendopeptidase protein or 
a mimetope thereof; an isolated parasite nucleic acid molecule 
capable of hybridizing under stringent conditions with a D. 
immitis astacin metalloendopeptidase gene; an anti-parasite 
astacin metalloendopeptidase antibody; an inhibitor of astacin 
metalloendopeptidase activity identified by its ability to 
inhibit parasite astacin metalloendopeptidase activity; an 
isolated filariid nematode cysteine protease protein or a 
mimetope thereof; an isolated filariid nematode nucleic acid 
molecule capable of hybridizing under stringent conditions 
with a D. immitis cysteine protease gene; an anti-f ilariid 
nematode cysteine protease antibody; and an inhibitor of 
cysteine protease activity identified by its ability to 
inhibit filariid nematode cysteine protease activity. Also 
included is a method to protect an animal from disease caused 
by a parasite that includes administering to the animal in an 
effective manner a therapeutic composition of the present 
invention, A preferred therapeutic composition of the present 
invention is a composition capable of protecting an animal 
from heartworm. 

The present invention also includes a method to identify 
a compound capable of inhibiting astacin metalloendopeptidase 
activity of a parasite. Such a method includes (a) contacting 
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an isolated parasite astacin metalloendopeptidase protein with 
a putative inhibitory compound under conditions in which, in 
the absence of the compound, the astacin metalloendopeptidase 
protein has astacin metalloendopeptidase activity; and (b) 
determining if the putative inhibitory compound inhibits 
astacin metalloendopeptidase activity. Also included is a 
test kit to identify a compound capable of inhibiting astacin 
metalloendopeptidase activity that includes an isolated 
parasite astacin metalloendopeptidase protein having astacin 
metalloendopeptidase activity and a means for determining the 
extent of inhibition of astacin metalloendopeptidase activity 
in the presence of a putative inhibitory compound. 

Also included in the present invention is a method to 
identify a compound capable of inhibiting cysteine protease 
activity of a parasite. Such a method includes (a) contacting 
an isolated filariid cysteine protease protein with a putative 
inhibitory compound under conditions in which, in the absence 
of the compound, the filariid cysteine protease protein has 
cysteine protease activity; and (b) determining if the 
putative inhibitory compound inhibits cysteine protease 
activity. Also included is a test kit to identify a compound 
capable of inhibiting cysteine protease activity of a parasite 
that includes an isolated filariid cysteine protease protein 
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having cysteine protease activity and a means for determining 
the extent of inhibition of cysteine protease activity in the 
presence of a putative inhibitory compound. 

Detailed Description of the Invention 
The present invention includes the discovery that 
parasites such as D. immitis express astacin 
metalloendopeptidases. As such, the present invention 
includes nucleic acid molecules encoding such proteins as well 
as the proteins themselves. To the inventors^ knowledge, 
astacin metalloendopeptidases have been found previously only 
in humans, mice, rats, crayfish, Drosophila, Xenopus, sea 
urchins, chorioallontoic membranes of quail eggs, and medaka 
fish (Oryzias latipes) • The present invention also includes 
the first identification and isolation of nucleic acid 
molecules encoding f ilariid nematode cysteine proteases as 
well as the cysteine proteases themselves, after a difficult 
and time-consuming search by the inventors for such nucleic 
acid molecules. Isolated nucleic acid molecules and proteins 
of the present invention, including homologues of such nucleic 
acid molecules and proteins, are useful both in protecting 
animals from parasite infections and in other applications, 
including those disclosed below. 
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One embodiment of the present invention is an isolated 
parasite (or parasitic) astacin metalloendopeptidase protein 
or a mimetope thereof (i.e., a mimetope of a parasite astacin 
metalloendopeptidase protein) . According to the present 
invention, an isolated, or biologically pure, protein, is a 
protein that has been removed from its natural milieu. As 
such, "isolated" and "biologically pure" do not necessarily 
reflect the extent to which the protein has been purified. An 
isolated parasite astacin metalloendopeptidase protein can be 
obtained from its natural source. Such an isolated protein 
can also be produced using recombinant DNA technology or 
chemical synthesis. 

As used herein, an isolated parasite astacin 
metalloendopeptidase protein can be a full-length protein or 
any homologue of such a protein, such as a protein in which 
amino acids have been deleted (e.g., a truncated version of 
the protein, such as a peptide) , inserted, inverted, 
substituted and/or derivatized (e.g., by glycosylation, 
phosphorylation, acetylation, myristylation, prenylation, 
palmitation, amidation and/or addition of glycosylphosphatidyl 
inositol) such that the homologue has astacin 
metalloendopeptidase activity and/or is capable of eliciting 
an immune response against a natural D. immitis astacin 
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metalloendopeptidase protein. As used herein^r an astacin 
metalloendopeptidase protein, the full-length version of which 
is a protein that includes an extended zinc-binding domain, 
has characteristics similar to astacin and other members of 
5 the "astacin family of metalloendopeptidases" ; see, for 
example, Dumermuth et al., ijbid. A protein having astacin 
metalloendopeptidase activity is a protein that can cleave 
proteins in a manner similar to the zinc-dependent protease 
astacin. Astacin activity is inhibited by metal chelators 

10 such as ethylene diamine tetraacetic acid (EDTA) and 1,10- 
phenanthroline but not by phosphoramidon, an inhibitor of 
several other metalloproteases including thermolysin and 
neutral endopeptidases . Tissue inhibitors of 

metalloproteinases (TIMPl and TIMP2) , which are the best 

15 characterized protein inhibitors of zinc endopeptidases, do 
not demonstrate inhibitory activity with astacin (Stocker and 
Zwillig, 1995, Methods of Enzymology, vol. 248) . Astacin 
activity can be detected by those skilled in the art (see, for 
example, Dumermuth et al . , ibid.). A protein capable of 

20 eliciting an immune response against a natural protein is a 
protein that includes at least one epitope capable of 
eliciting an immune response (humoral and/or cellular) against 
that natural protein (such as a D. immitis astacin 
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metalloendopeptidase or a D. immitis cysteine protease) when 
the protein (e.g., the natural protein or a homologue thereof) 
is administered to an animal as an immunogen using techniques 
known to those skilled in the art. The ability of a protein 
to effect an immune response can be measured using techniques 
known to those skilled in the art. Examples of methods to 
measure an immune response (e.g., antibody detection, 
resistance to infection) are disclosed herein. 

A parasite astacin metalloendopeptidase protein of the 
present invention, including a homologue of the full-length 
protein, has the further characteristic of being encoded by a 
nucleic acid molecule that is capable of hybridizing, under 
stringent conditions, with (i.e., to) a D. immitis astacin 
metalloendopeptidase gene. A preferred astacin 

metalloendopeptidase protein of the present invention is 
encoded by a nucleic acid molecule capable of hybridizing, 
under stringent conditions, with at least a portion of the 
complement of the nucleic acid sequence disclosed in SEQ ID 
N0:1, SEQ ID N0:2, SEQ ID NO:29, SEQ ID NO: 30, SEQ ID NO:32 
and/or SEQ ID NO: 33. As used herein, the phrase "at least a 
portion of" an entity refers to an amount of the entity that 
is at least sufficient to have the functional aspects of that 
entity. For example, at least a portion of a nucleic acid 
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sequence^ as used herein, is an amount of a nucleic acid 
sequence capable of forming a stable hybrid under stringent 
hybridization conditions. As used herein, stringent 

hybridization conditions refer to standard hybridization 
5 conditions under which nucleic acid molecules (or sequences) , 
including oligonucleotides, are used to identify similar 
sequences. Such standard conditions are disclosed, for 
example, in Sambrook et al.. Molecular Cloning: A Laboratory 
Manual^ Cold Spring Harbor Labs Press, 1989. 

10 Stringent hybridization conditions typically permit 

isolation of nucleic acid molecules having at least about 70% 
nucleic acid sequence identity with the nucleic acid molecule 
being used as a probe in the hybridization reaction. 
Formulae to calculate the appropriate hybridization and wash 

15 conditions to achieve hybridization permitting 30% or less 
mis-match between two nucleic acid molecules are disclosed, 
for example, in Meinkoth et al, 1984, Anal. Biochem 138^ 267- 
284; Meinkoth et al, ijbid, is incorporated by reference herein 
in its entirety. An example of such conditions includes, but 

20 is not limited to, the following: Oligonucleotide probes of 
about 18-25 nucleotides in length with T^'s ranging from about 
SOX to about for example, can be hybridized to nucleic 

acid molecules typically immobilized on a filter (e.g., 
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nitrocellulose filter) in a solution containing 2X SSPE, 1% 
Sarkosyl, 5X Denhardts and 0.1 mg/ml denatured salmon sperm 
DNA at a temperature as calculated using the formulae of 
Meinkoth et al • , ijbid, for about 2 to about 12 hours. The 
filters are then washed 3 times in a wash solution containing 
2X SSPE, 1% Sarkosyl at about 55X for about 15 minutes each. 
The filters can be further washed in a wash solution 
containing 2X SSPE, 1% Sarkosyl at about 55°C for about 15 
minutes per wash. Further examples of such conditions are 
provided in the Examples section. 

It should be noted that the extent of homology required 
to form a stable hybrid can vary depending on whether the 
homologous sequences are interspersed throughout the nucleic 
acid molecules or are clustered (i.e., localized) in distinct 
regions on the nucleic acid molecules. Also in accordance 
with present invention, at least a portion of a astacin 
metalloendopeptidase protein is a protein of sufficient size 
to have astacin metalloendopeptidase activity and/ or to be 
able to elicit an immune response against a natural parasite 
astacin metalloendopeptidase. 

SEQ ID N0:1, SEQ ID N0:2, SEQ ID NO:29, SEQ ID NO:30, SEQ 
ID NO: 32 and SEQ ID NO: 33 represent the nucleotide sequences 
of six cDNA (complementary DNA) nucleic acid molecules denoted 

17 
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nDiMPAli299. nDiMPA22i26/ L3 nDiMPA32292. L3 nDiMPASsoTe, adult 
nDiMPA32032/ and adult nDiMPA32028^ respectively, the production 
of which is disclosed in the Examples. It should be noted 
that since nucleic acid sequencing technology is not entirely 
error-free, SEQ ID N0:1, SEQ ID N0:2, SEQ ID NO:29, SEQ ID 
NO: 30, SEQ ID NO: 32 and SEQ ID NO: 33 represent, at best, 
apparent nucleic acid sequences of the respective nucleic acid 
molecules. As will be discussed in greater detail below, 
nucleic acid molecules nDiMPAl^sgg and nDiMPA22i26 apparently 
comprise overlapping open reading frames, as deduced from SEQ 
ID N0:1 and SEQ ID NO: 2. Each of the nucleic acid molecules 
L3 nDiMPA32292 and adult nDiMPA32032 apparently comprises a 
single open reading frame as deduced from SEQ ID NO: 29 and SEQ 
ID NO: 32, denoted L3 nDiMPA32076 (SEQ ID NO: 30) and adult 
nDiMPA32028 (SEQ ID NO:33), respectively. The deduced amino 
acid sequences encoded by L3 nDiMPA32076 and adult nDiMPA32028 
are disclosed, respectively, in SEQ ID NO: 31 and SEQ ID NO: 34. 
Nucleic acid molecule nDlMPAl^ggg apparently comprises three 
open reading frames, referred to herein as PDIMPAIorfi/ 
PDiMPAloRF2 and PDIMPAIorfs/ the deduced amino acid sequences of 
which are disclosed, respectively, in SEQ ID NO: 3, SEQ ID NO: 4 
and SEQ ID NO: 5. Nucleic acid molecule nDiMPA22i26 apparently 
comprises five open reading frames, referred to herein as 
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PDiMPA2oRnA PDiMPA2oRF2/ PDiMPA2oRF3^ PDiMPA2oRF4 and PDiMPA2oRF5/ 
the deduced amino acid sequences of which are disclosed, 
respectively, in SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID 
NO: 9 and SEQ ID NO: 10. That the open reading frames on both 
5 nucleic acid molecules are overlapping raises the possibility 
that translation of a functional astacin metalloendopeptidase 
in vivo may involve f rameshif ting. Both SEQ ID N0:1 and SEQ 
ID NO: 2 contain nucleic acid sequences, including stem-loop 
structures, that, for frameshift viral gene expression, have 

10 been implicated in ribosome slowing and, hence, frameshift 
translation. The presence of stem loop structures in the mKNA 
could have caused the reverse transcriptase to stutter or 
misread the mRNA during the Dirofilaria cDNA library 
construction. This lack of faithful reproduction of the cDNA 

15 from the mRNA template could account for the base pairs 
missing in the original cDNA clones obtained from the library 
having the nucleic acid sequences SEQ ID N0:1 and SEQ ID NO: 2. 
Alternatively, nucleic acid molecules nDiMPAl^ggg and nDiMPA22i26 
may also be the result of alternative splicing patterns. 

2 0 L3 nDiMPA32292 apparently comprises a single open reading 

frame, referred to herein as L3 nDlMPASsoTs (SEQ ID NO:30), 
which encodes a protein, namely PDiMPASggs/ the deduced amino 
acid sequence of which is disclosed in SEQ ID NO: 31, Adult 
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nDiMPA32032 also apparently comprises a single open reading 
frame, referred to herein as adult nDiMPA32028 (SEQ ID NO:33), 
which encodes a protein, namely adult PDiMPA3g76, the deduced 
amino acid sequence of which is disclosed in SEQ ID NO: 34. 

SEQ ID NO: 11 represents a composite amino acid sequence 
derived from the five open reading frames encoded by 
nDiMPA22i26- such, SEQ ID NO: 11 is an example of a 

combination of disclosed open reading frames, in this case a 
combination of PDiMPA2oRPi, PDiMPA2oRF2/ PDiMPA2oRF3/ PdiMPA2oRF4 
and PDiMPA2oRF5* The astacin domain of SEQ ID NO: 11 has about 
29 percent amino acid sequence homology (i.e., identity within 
comparable regions) with the amino acid sequence of crayfish 
astacin. As used herein, an astacin domain is an amino acid 
sequence of about 200 amino acids that shares significant 
homology with crayfish astacin, which is a 202-amino acid 
protein. The astacin domain of SEQ ID NO: 11 spans from about 
amino acid positions 98 through 299. The astacin domain of 
SEQ ID NO: 11 also shares about 30 percent, 31 percent, 33 
percent and 33 percent homology at the amino acid level with 
the astacin domains of, respectively, human bone morphogenetic 
protein 1, mouse kidney brush border metalloendopeptidase, 
human intestinal brush border metalloendopeptidases and 



20 



2618-21-1-C1 

Xenopus laevis embryonic protein UVS,2 (using sequences 
provided in Dumermuth et al . , ibid.). 

SEQ ID NO: 31 represents the deduced amino acid sequence 
of the single open reading frame of L3 nDiMPA32292^ which is 
represented herein as nucleic acid molecule L3 nDiMPA32076 (SEQ 
ID NO:30). The astacin domain of SEQ ID N0:31 spans amino 
acid positions from about 122 through 32 6. The astacin domain 
of SEQ ID NO: 31 shares about 27.3 percent, 31.7 percent, and 
34.1 percent homology at the amino acid level with the astacin 
domains of, respectively, crayfish astacin, quail astacin and 
the C. elegans R151.5 gene product, (Genbank accession number 
U00036) . SEQ ID N0:31 shows about 81.7% homology with the 
composite amino acid sequence derived from the five open 
reading frames encoded by nDiMPA22i26 (SEQ ID N0:11). 

SEQ ID NO: 34 represents the deduced amino acid sequence 
of the single open reading frame of adult nDiMPA32032/ which is 
represented herein as nucleic acid molecule adult nDiMPA32028 
(SEQ ID NO: 33) . The astacin domain of SEQ ID NO: 34 spans from 
about amino acid positions 122 through 326. The astacin 
domain of SEQ ID NO: 34 shares about 2 6.3 percent, 31.2 
percent, and 34.6 percent homology at the amino acid level 
with the astacin domains of, respectively, crayfish astacin, 
quail astacin and the C. elegans R151.5 gene product, (Genbank 

21 
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accession number U00036) . SEQ ID NO:34 shows about 81.3% 
homology with the composite amino acid sequence derived from 
the five open reading frames encoded by nDiMPA22a26 (SEQ ID 
NO: 11) • 

The amino acid sequences presented as SEQ ID NO: 31 (L3 
PDiMPA3692) and SEQ ID NO: 34 (adult PD IMP A3 57 J contain three 
regions of homology which are conserved within about a 61 
amino acid region of all known astacins. In L3 PDiMPA3692 and 
adult PDiMPA3676^ these three regions span about a 60 amino 
acid sequence corresponding to amino acid positions from about 
214 through about 273 of L3 PDiMPA3692/ and to amino acid 
positions from about 198 through about 257 of adult PDiMPA3676 
(as numbered in SEQ ID NO: 31 and SEQ ID NO: 34, respectively) . 
The first region of homology includes the zinc binding domain, 
which spans positions from about 214 through about 224 of SEQ 
ID NO: 31 and positions from about 198 through about 208 of SEQ 
ID NO: 34, This first region includes three histidines which 
are present in all astacins for zinc binding (imidazole zinc 
ligands) at positions 214, 218 and 224 of SEQ ID NO: 31 and at 
positions 198, 202 and 208 of SEQ ID NO: 34, and a glutamate at 
position 215 of SEQ ID NO: 31 and at position 199 of SEQ ID 
NO: 34, which is assumed to be catalytically important in all 
astacins. In addition, this first region includes a glycine 
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which is important for secondary structure of the protein at 
position 221 of SEQ ID NO: 31 and at position 205 of SEQ ID 
NO: 34, and a glutamate which forms a salt bridge with the 
amino terminus of the mature astacin protein at position 225 
of SEQ ID NO: 31 and at position 209 of SEQ ID NO: 34. 

The second region found in L3 PDiMPA3592 and adult 
PDiMPA3676 that is conserved in all known astacins spans amino 
acid positions 228 through 232 of SEQ ID NO: 31 and positions 
212 through 216 of SEQ ID NO: 34, This second region is a 
hydrophilic region common to all astacins. 

The third region found in L3 PDiMPASggs and adult 
PDiMPA3 676 that is conserved in all known astacins spans amino 
acid positions 265 through 273 of SEQ ID NO: 31 and positions 
249 through 257 of SEQ ID NO: 34, and contains a portion of the 
zinc binding domain. In particular, the hydroxyl oxygen of 
the tyrosine at position 273 of SEQ ID NO: 31 and position 257 
of SEQ ID NO: 34 is the fourth amino acid zinc ligand. It has 
been proposed that the catalytically active zinc ion of 
astacins is penta-coordinated with a water molecule serving as 
the fifth zinc ligand (Stocker et al., 1993, Eur. J. Biochem.) 
In many known astacins, this tyrosine is typically at position 
61 from the first amino acid of the zinc binding domain (i.e., 
61 amino acids from the first histidine in the first region) . 



2618-21-1-C1 

In L3 PDiMPA3692 and adult PDiMPASgvs, this tyrosine is at 
position 60 from the first amino acid of the zinc binding 
domain (i.e., 60 amino acids from the first histidine in the 
first region at position 214 of SEQ ID NO: 31 and position 198 
5 of SEQ ID NO:34) . 

A preferred astacin metalloendopeptidase protein of the 
present invention includes an amino acid sequence having at 
least about 35 percent, more preferably at least about 45 
percent, even more preferably at least about 60 percent and 

10 even more preferably at least about 75 percent, amino acid 
homology with the astacin domain of SEQ ID NO: 11 (i.e., with 
the corresponding regions of the astacin domain of SEQ ID 
NO: 11) . A more preferred astacin metalloendopeptidase protein 
of the present invention includes an amino acid sequence 

15 having at least about 40 percent, more preferably at least 
about 45 percent, even more preferably at least about 60 
percent and even more preferably at least about 75 percent, 
amino acid homology with the astacin domain of SEQ ID NO: 31 or 
SEQ ID NO: 34. An even more preferred astacin 

2 0 metalloendopeptidase protein of the present invention includes 
at least a portion of at least one of the following open 
reading frames: PDIMPAIorfi. PDIMPAIor^s/ PDIMPAIorfsa PDiMPA2oRFi, 
PDiMPA2oRF2A PDiMPA2oRF3A PDiMPA2oRF4A PDiMPA2oRF5A L3 PDiMPA3692 and 
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adult PDiMPA3676, the deduced amino acid sequences of which are 
disclosed, respectively, in SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID 
N0:5, SEQ ID N0:6, SEQ ID N0:7, SEQ ID N0:8, SEQ ID N0:9, SEQ 
ID NO: 10, SEQ ID NO: 31 and SEQ ID NO: 34, Preferred astacin 
5 metalloendopeptidase proteins of the present invention include 
an extended zinc-binding domain motif. More preferred astacin 
metalloendopeptidase proteins also contain the tyrosine zinc 
binding amino acid-containing domain as identified by Jiang et 
al., ibidf and disclosed above, 

10 Parasite astacin metalloendopeptidase protein homologues 

can be the result of natural allelic variation or natural 
mutation. Homologues can also be produced using techniques 
known in the art including, but not limited to, direct 
modifications to the protein or modifications to the gene 

15 encoding the protein using, for example, classic or 
recombinant DNA techniques to effect random or targeted 
mutagenesis. Isolated astacin metalloendopeptidase proteins 
of the present invention, including homologues, can be 
identified in a straight-forward manner by the proteins' 

2 0 ability to effect astacin metalloendopeptidase activity and/or 
to elicit an immune response against parasite astacin 
metalloendopeptidase proteins. Examples of such 

identification techniques are disclosed herein. 
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The minimum size of an isolated protein of the present 
invention is sufficient to form an epitope, a size that is 
typically at least from about 7 to about 9 amino acids • As is 
appreciated by those skilled in the art, an epitope can 
include amino acids that naturally are contiguous to each 
other as well as amino acids that, due to the tertiary 
structure of the natural protein, are in sufficiently close 
proximity to form an epitope. 

Any parasite astacin metalloendopeptidase is a suitable 
protein of the present invention. Suitable parasites from 
which to isolate proteins (including isolation of the natural 
protein or production of the protein by recombinant or 
synthetic techniques) include parasitic helminths, protozoa 
and ectoparasites such as, but not limited to nematodes, 
cestodes, trematodes, fleas, flies, ticks, lice, true bugs, 
and protozoa of the genera Babesia^ Cryptosporidium^ Eimeria^ 
Encephalitozoon^ Hepatozoon^ Isospora^ Leishmanial Neospora, 
Nosema^ Plasmodium^ Pneumocystis^ Theileria^ Toxoplasma and 
Trypanosoma. Preferred parasites include tissue-migrating 
parasitic helminths, and particularly tissue-migrating 
nematodes. Preferred nematodes include filariid, ascarid, 
strongyle and trichostrongyle nematodes • Particularly 
preferred tissue-migrating nematodes include parasites of the 
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genera Acanthocheilonemar Aelurostrongylus r Ancylostoma, 
Angiostrongylusr Ascaris, Brugia, Bunostomum, Dictyocaulus , 
Dioctophymer Dipetalonema, Dirofilaria, Dracunculus, 
Filaroidesr Lagochilascaris, Loa, Mansonella, Muellerius, 
Necator, Onchocerca, Parafilaria, Parascaris, Protostrongylus, 
Setariar Stephanofilariar Strongyloides, Strongylus, Thelazia, 
ToxascariSr Toxocara, Trichinella, Uncinaria and Wuchereria. 
Other particularly preferred nematodes include parasites of 
the genera Capillariar Chahertia, Cooperia, Enterohiusr 
Eaemonchusr Nematodirus, Oesophagostomum, Ostertagia, 
Trichostrongylus and Trichuris. Preferred filariid nematodes 
include Dirofilariar Acanthocheilonema, Brugia, Dipetalonema, 
Loa, Onchocerca, Parafilaria, Setaria, Stephanofilaria and 
Wuchereria filariid nematodes. A particularly preferred 
parasite is a nematode of the genus Dirofilaria, with D. 
immitis, the parasite that causes heartworm, being even more 
preferred. 

In accordance with the present invention, a mimetope of 
a protein refers to any compound that is able to mimic the 
activity of that protein, often because the mimetope has a 
structure that mimics the protein. For example, a mimetope of 
a parasite astacin metalloendopeptidase protein is a compound 
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that has an activity similar to that of an isolated parasite 
astacin metalloendopeptidase protein of the present invention. 
Mimetopes can be, but are not limited to: peptides that have 
been modified to decrease their susceptibility to degradation; 
anti-idiotypic and/or catalytic antibodies, or fragments 
thereof; non-proteinaceous immunogenic portions of an isolated 
protein (e.g., carbohydrate structures); and synthetic or 
natural organic molecules, including nucleic acids. Such 
mimetopes can be designed using computer-generated structures 
of proteins of the present invention. Mimetopes can also be 
obtained by generating random samples of molecules, such as 
oligonucleotides, peptides or other organic molecules, and 
screening such samples by affinity chromatography techniques 
using, for example, antibodies raised against a protein of the 
present invention . 

A preferred parasite astacin metalloendopeptidase protein 
or mimetope of the present invention is a compound that when 
administered to an animal in an effective manner, is capable 
of protecting that animal from disease caused by a parasite 
that is susceptible to treatment to inhibit astacin 
metalloendopeptidase activity. As such, the parasite 
preferably is essentially incapable of causing disease in an 
animal that is immunized with a parasite astacin 
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10 



metalloendopeptidase protein, and preferably with a D. immitis 
astacin metalloendopeptidase protein, of the present 
invention. In accordance with the present invention, the 
ability of a protein or mimetope of the present invention to 
protect an animal from disease by a parasite refers to the 
ability of that protein or mimetope to treat, ameliorate 
and/or prevent disease, including infection leading to 
disease, caused by the parasite, preferably by eliciting an 
immune response against the parasite. Such an immune response 
can include humoral and/or cellular immune responses. 
Suitable parasites to target include any parasite that is 
essentially incapable of causing disease in an animal 
administered a parasite astacin metalloendopeptidase protein 
of the present invention. As such, a parasite to target 
15 includes any parasite that produces a protein having one or 
more epitopes that can be targeted by a humoral and/or 
cellular response against a parasite astacin 
metalloendopeptidase protein of the present invention and/or 
that can be targeted by a compound that is capable of 
20 substantially inhibiting parasite astacin metalloendopeptidase 
activity, thereby resulting in the reduced ability of the 
parasite to cause disease in an animal. Suitable and 
preferred parasites to target include those parasites 
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disclosed above. A preferred class of parasites to target 
include tissue-migrating parasitic helminths. A particularly 
preferred nematode helminth to target is D. immitis, which 

causes heartworm. 

One embodiment of the present invention is a fusion 
protein that includes a parasite astacin metalloendopeptidase 
domain attached to a fusion segment. Inclusion of a fusion 
segment as part of a protein of the present invention can 
enhance the protein's stability during production, storage 
and/or use. Depending on the segment's characteristics, a 
fusion segment can also act as an immunopotentiator to enhance 
the immune response mounted by an animal immunized with a 
protein of the present invention that contains such a fusion 
segment. Furthermore, a fusion segment can function as a tool 
to simplify purification of a protein of the present 
invention, such as to enable purification of the resultant 
fusion protein using affinity chromatography. A suitable 
fusion segment can be a domain of any size that has the 
desired function (e.g., imparts increased stability to a 
protein, imparts increased immunogenicity to a protein, and/or 
simplifies purification of a protein) . It is within the scope 
of the present invention to use one or more fusion segments. 
Fusion segments can be joined to amino and/or carboxyl termini 
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of the astacin metalloendopeptidase-containing domain of the 
protein* Linkages between fusion segments and astacin 
metalloendopeptidase-containing domains of fusion proteins can 
be susceptible to cleavage in order to enable straight-forward 
recovery of the astacin metalloendopeptidase-containing 
domains of such proteins. Fusion proteins are preferably 
produced by culturing a recombinant cell transformed with a 
fusion nucleic acid molecule that encodes a protein including 
the fusion segment attached to either the carboxyl and/or 
amino terminal end of an parasite astacin 
metalloendopeptidase-containing domain. 

Preferred fusion segments for use in the present 
invention include a glutathione binding domain, such as 
Schistosoma japonicum glutathione-S-transf erase (GST) or a 
portion thereof capable of binding to glutathione; a metal 
binding domain, such as a poly-histidine segment capable of 
binding to a divalent metal ion; an immunoglobulin binding 
domain, such as Protein A, Protein G, T cell, B cell, Fc 
receptor or complement protein antibody-binding domains; a 
sugar binding domain such as a maltose binding domain from a 
maltose binding protein; and/or a "tag" domain (e.g., at least 
a portion of p-galactosidase, a strep tag peptide, other 
domains that can be purified using compounds that bind to the 
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domain, such as monoclonal antibodies) . More preferred fusion 
segments include metal binding domains, such as a poly- 
histidine segment; a maltose binding domain; a strep tag 
peptide, such as that available from Biometra in Tampa, FL; 
and an SIO peptide. An example of a preferred fusion protein 
of the present invention is PHlS-PDiMPA2804/ the production of 
which is disclosed herein. 

Another embodiment of the present invention is a parasite 
astacin metalloendopeptidase protein that also includes at 
least one additional protein segment that is capable of 
protecting an animal from one or more diseases. Such a 
multivalent protective protein can be produced by culturing a 
cell transformed with a nucleic acid molecule comprising two 
or more nucleic acid domains joined together in such a manner 
that the resulting nucleic acid molecule is expressed as a 
multivalent protective compound containing at least two 
protective compounds, or portions thereof, capable of 
protecting an animal from diseases caused, for example, by at 
least one infectious agent. 

Examples of multivalent protective compounds include, but 
are not limited to, a parasite astacin metalloendopeptidase 
protein attached to one or more other parasite proteins, such 
to a filariid nematode cysteine protease protein of the 
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present invention. Other examples of multivalent protective 
compounds include a parasite astacin metalloendopeptidase 
protein attached to one or more compounds protective against 
one or more other infectious agents, particularly an agent 
that infects cats or dogs^ such as, but not limited to, 
calicivirus, distemper virus, feline herpesvirus, feline 
immunodeficiency virus, feline leukemia virus, feline 
infectious peritonitis, hepatitis, hookworm, leptospirosis, 
panleukopenia virus, parvovirus, rabies and toxoplasmosis. 

Suitable heartworm multivalent protective proteins 
include, but are not limited to, a D. immitis astacin 
metalloendopeptidase and/or a D. immitis cysteine protease of 
the present invention attached to at least one other D. 
immitis protein such as, but not limited to, a D. immitis Gp29 
protein, a D. immitis P39 protein, a D. immitis P22U protein, 
a D. immitis P22L protein, a D. immitis P20.5 protein, a D. 
immitis P4 protein, a D. immitis Di22 protein and/or a D. 
immitis protease expressed in L3 and/or L4 larvae, as well as 
other helminth proteins sharing significant homology with such 
D. immitis proteins. A protein sharing significant homology 
with another protein refers to the ability of the nucleic acid 
sequences encoding such proteins to form, stable hybridization 
complexes with each other under stringent hybridization 
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conditions, as described, for example, in Sambrook et al,, 
ibid. U.S. Patent Application Serial No, 08/208,885, filed 
March 8, 1994, entitled "D. immitis Gp29 Proteins, Nucleic 
Acid Molecules and Uses Thereof", discloses D, immitis Gp29 
5 proteins and nucleic acid molecules that encode them. U.S. 
Patent Application Serial No. 08/003,389, filed January 12, 
1993, entitled "Immunogenic Larval Proteins", discloses a 39- 
kD (kilodalton) D. immitis protein (size determined by Tris 
glycine SDS-PAGE (sodium dodecyl sulfate polyacrylamide gel 

10 electrophoresis)), referred to herein as P39, and a nucleic 
acid sequence that encodes it. U.S. Patent Application Serial 
No. 08/003,257, filed January 12, 1993, entitled "Reagents and 
Methods for Identification of Vaccines", discloses 22-kD and 
20.5"kD D. immitis proteins (sizes determined by Tris glycine 

15 SDS-PAGE), referred to herein as P22L and P20.5, and nucleic 
acid sequences that encode them. U.S. Patent Application 
Serial No. 08/109,391, filed August 19, 1993, entitled "Novel 
Parasitic Helminth Proteins", discloses D, immitis P4 and D. 
immitis P22U, as well as nucleic acid sequences that encode 

20 them. U.S. Patent Application Serial No. 08/060,500, filed 
May 10, 1993, entitled "Heartworm Vaccine", discloses a D. 
immitis Di22 protein and a nucleic acid sequence encoding it 
(included in Genbank data base accession number M82811) ; 
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Serial No. 08/060^500 is a continuation of U.S. Patent 
Application Serial No. 07/683,202, filed April 8, 1991. U.S. 
Patent Application Serial No. 08/153,554, filed November 16, 
1993, entitled "Protease Vaccine Against Heartworm", discloses 
5 D. immitis larval proteases; Serial No. 08/153,554 is a 
continuation of U.S. Patent Application Serial No. 07/792,209, 
filed November 12, 1991. Each of these patent applications is 
incorporated by reference herein in its entirety. 

A particularly preferred parasite astacin 

10 metalloendopeptidase protein is a protein encoded by at least 
a portion of SEQ ID N0:1, SEQ ID N0:2, SEQ ID NO:29, SEQ ID 
NO: 30, SEQ ID NO: 32 and SEQ ID NO: 33, and, as such, is a 
protein having an amino acid sequence encoded by at least a 
portion of at least one of the open reading frames encoding an 

15 amino acid sequence represented by SEQ ID NO: 3, SEQ ID NO: 4, 
SEQ ID N0:5, SEQ ID N0:6, SEQ ID N0:7, SEQ ID N0:8, SEQ ID 
NO: 9, SEQ ID NO: 10, SEQ ID NO: 31 and/or SEQ ID NO: 34. 
Preferred proteins can include combinations of amino acid 
sequences encoded by such reading frames, such as SEQ ID 

20 NO: 11, that result in a functional protein. A homology search 
using these open reading frames indicate that all but SEQ ID 
NO: 10 share significant homology with known members of the 
astacin metalloendopeptidase family and with a Caenorhabditis 
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elegans R151.5 gene product, Genbank accession number U00036 
(Wilson et al., 1994, Nature 368, 32-38), suggesting that C. 
eiegans also encodes a astacin metalloendopeptidase • 
Particularly well conserved are the extended zinc-binding 
5 domain motif and the tyrosine-containing domain motif, the 
overall sequence homology being about 24 percent* Even more 
preferred proteins are encoded by single reading frames which 
encode a protein having an amino acid sequence of SEQ ID NO: 31 
or SEQ ID NO: 34. A homology search using these open reading 

10 frames indicate that they also share significant homology with 
known members of the astacin metalloendopeptidase family and 
with a Caenorhabditis elegans R151,5 gene product, Genbank 
accession number U00036 (Wilson et al., supra). Again, 
particularly well conserved are the extended zinc-binding 

15 domain motif and the tyrosine-containing domain motif, the 
overall sequence homology being about 34,5 percent. 

Particularly preferred proteins of the present invention 
include proteins having the astacin domain of SEQ ID NO: 11, 
proteins having the astacin domain of SEQ ID NO: 31, proteins 

2 0 having the astacin domain of SEQ ID NO: 34, proteins that 
include these domains (such as, but not limited to, full- 
length proteins, fusion proteins and proteins providing 
multivalent protection) and proteins that are truncated 
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hoitiologues of these domains. Even more preferred proteins 
include PDiMPA2804. PHIS-PDiMPA2804/ L3 PDiMPASggs. and adult 
PDiMPABg.g. 

Another embodiment of the present invention is an 
5 isolated parasite nucleic acid molecule capable of 
hybridizing, under stringent conditions, with a D. immitis 
astacin metalloendopeptidase gene. As used herein, a D, 
immitis astacin metalloendopeptidase gene includes all nucleic 
acid sequences related to a natural D. immitis astacin 

10 metalloendopeptidase gene such as regulatory regions that 
control production of a D. immitis astacin 
metalloendopeptidase protein encoded by that gene (such as, 
but not limited to, transcription, translation or post- 
translation control regions) as well as the coding region 

15 itself. A parasite ^astacin metalloendopeptidase nucleic acid 
molecule of the present invention can include any isolated 
natural parasite astacin metalloendopeptidase gene or a 
homologue thereof. A nucleic acid molecule of the present 
invention can include one or more regulatory regions, full- 

20 length or partial coding regions, or combinations thereof. 
The minimal size of a parasite astacin metalloendopeptidase 
nucleic acid molecule of the present invention is the minimal 
size capable of forming a stable hybrid under stringent 

37 



2618-21-1'C1 

hybridization conditions. Suitable and preferred parasites 
are disclosed above. 

In accordance with the present invention, an isolated 
nucleic acid molecule is a nucleic acid molecule that has been 
5 removed from its natural milieu (i.e., that has been subject 
to human manipulation) . As such, "isolated" does not reflect 
the extent to which the nucleic acid molecule has been 
purified. An isolated nucleic acid molecule can include DNA, 
RNA, or derivatives of either DNA or RNA. 

10 An isolated parasite astacin metalloendopeptidase nucleic 

acid molecule of the present invention can be obtained from 
its natural source either as an entire (i.e., complete) gene 
or a portion thereof capable of forming a stable hybrid with 
that gene. An isolated parasite astacin metalloendopeptidase 

15 nucleic acid molecule can also be produced using recombinant 
DNA technology (e.g., polymerase chain reaction (PGR) 
amplification, cloning) or chemical synthesis. Isolated 
parasite astacin metalloendopeptidase nucleic acid molecules 
include natural nucleic acid molecules and homologues thereof, 

20 including, but not limited to, natural allelic variants and 
modified nucleic acid molecules in which nucleotides have been 
inserted, deleted, substituted, and/or inverted in such a 
manner that such modifications do not substantially interfere 
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with the nucleic acid molecule's ability to encode a parasite 
astacin metalloendopeptidase protein of the present invention 
or to form stable hybrids under stringent conditions with 
natural isolates, 
5 A parasite astacin metalloendopeptidase nucleic acid 

molecule homologue can be produced using a number of methods 
known to those skilled in the art (see,, for example, Sambrook 
et al., ibid.). For example, nucleic acid molecules can be 
modified using a variety of techniques including, but not 

10 limited to, classic mutagenesis techniques and recombinant DNA 
techniques, such as site-directed mutagenesis, chemical 
treatment of a nucleic acid molecule to induce mutations, 
restriction enzyme cleavage of a nucleic acid fragment, 
ligation of nucleic acid fragments, polymerase chain reaction 

15 (PGR) amplification and/or mutagenesis of selected regions of 
a nucleic acid sequence, synthesis of oligonucleotide mixtures 
and ligation of mixture groups to "build" a mixture of nucleic 
acid molecules and combinations thereof. Nucleic acid 
molecule homologues can be selected from a mixture of modified 

20 nucleic acids by screening for the function of the protein 
encoded by the nucleic acid (e.g., astacin 
metalloendopeptidase activity or ability to elicit an immune 
response against at least one epitope of a parasite astacin 
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metalloendopeptidase protein) and/or by hybridization with 
isolated parasite astacin metalloendopeptidase nucleic acids 
under stringent conditions. 

An isolated nucleic acid molecule of the present 
invention can include a nucleic acid sequence that encodes at 
least one parasite astacin metalloendopeptidase protein of the 
present invention, examples of such proteins being disclosed 
herein. Although the phrase "nucleic acid molecule" primarily 
refers to the physical nucleic acid molecule and the phrase 
"nucleic acid sequence" primarily refers to the sequence of 
nucleotides on the nucleic acid molecule, the two phrases can 
be used interchangeably, especially with respect to a nucleic 
acid molecule, or a nucleic acid sequence, being capable of 
encoding a protein. As heretofore disclosed, parasite astacin 
metalloendopeptidase proteins of the present invention 
include, but are not limited to, proteins having full-length 
astacin metalloendopeptidase coding regions, proteins having 
partial astacin metalloendopeptidase coding regions, fusion 
proteins, multivalent protective proteins and combinations 
thereof. 

One embodiment of the present invention is a parasite 
astacin metalloendopeptidase nucleic acid molecule that is 
capable of hybridizing under stringent conditions with nucleic 
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acid molecule nDiMPAl-L299/ with nucleic acid molecule 
nDiMPA22i26A with nucleic acid molecule L3 nDiMPA32292^ with 
nucleic acid molecule L3 nDiMPA32076/ ^ith nucleic acid 
molecule adult nDiMPA32032/ and/or with nucleic acid molecule 
5 adult nDiMPA32028 • such, preferred astacin 

metalloendopeptidase nucleic acid molecules are capable of 
forming stable hybrids with nucleic acid molecules represented 
by SEQ ID N0:1, SEQ ID N0:2, SEQ ID NO:29, SEQ ID NO:30, SEQ 
ID NO: 32 and/or SEQ ID NO: 33, Particularly preferred astacin 

10 metalloendopeptidase nucleic acid molecules comprise at least 
a portion of nucleic acid molecule nDiMPAl-Lsgg^ nucleic acid 
molecule nDiMPA22i26/ nucleic acid molecule L3 nDiMPA32292^ 
nucleic acid molecule L3 nDiMPA32076^ adult nDiMPA32032/ and/or 
adult nDiMPA32028 - such, a preferred nucleic acid molecule 

15 of the present invention includes a nucleic acid sequence 
including at least a portion of SEQ ID N0:1, SEQ ID NO: 2, SEQ 
ID N0:29, SEQ ID NO:30, SEQ ID NO:32 and/or SEQ ID N0:33. 
Such a nucleic acid molecule can be nDiMPAl-L2g9/ nDiMPA22i26/ ^3 
nDiMPA32292 / L3 nDiMPA32076/ adult nDiMPA32032^ and/or adult 

2 0 nDiMPA32028A can include nucleotides in addition to nDiMPAli299/ 
nDiMPA22i26/ L3 nDiMPA32292/ L3 nDiMPA32076f adult nDiMPA32032/ 
and/or adult nDiMPA32028^ or can be a truncation fragment of 
nDiMPAla299/ nDiMPA22i26/ L3 nDiMPA32292/ L3 nDiMPA32076/ adult 



41 



2618-21-1-C1 

nDiMPA3 2032' and/or adult nDiMPASaoas- Particularly preferred 
nucleic acid molecules include nDiMPAlggg/ nDiMPAli299' 
nDiMPA22i26f nDiMPA2804^ nDiMPA227i, L3 nDiMPA32292A L3 nDiMPASzoTe/ 
adult nDiMPA32032, adult nDiMPA32028/ and BvMPA2, the production 
of which are disclosed in the Examples. 

One preferred embodiment of the present invention is a 
parasite astacin metalloendopeptidase nucleic acid molecule 
capable of hybridizing to the complement of the coding strand 
of a nucleic acid molecule encoding at least one open reading 
frame encoding at least one of the following amino acid 
sequences: SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, 
SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID 
NO: 11, SEQ ID NO: 31 and SEQ ID NO: 34. Preferably, such a 
nucleic acid molecule encodes a protein that shares at least 
about 35 percent, more preferably at least about 45 percent, 
even more preferably at least about 60 percent and even more 
preferably at least about 75 percent amino acid homology with 
SEQ ID NO: 11. Even more preferably, such a nucleic acid 
molecule encodes a protein that shares at least about 40 
percent, more preferably at least about 45 percent, even more 
preferably at least about 60 percent and even more preferably 
at least about 75 percent amino acid homology with SEQ ID 
NO: 31 and/or SEQ ID NO: 34. More preferred astacin 
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metalloendopeptidase nucleic acid molecules encode at least a 
portion of at least one of such open reading frames. 
Particularly preferred astacin metalloendopeptidase nucleic 
acid molecules are capable of forming stable hybrids with 
nucleic acid molecules encoding an extended zinc-binding 
domain motif (i.e., to the carboxyl terminus of the motif as 
well as to the general zinc-binding domain) and, more 
preferably, also with nucleic acid molecules encoding other 
disclosed conserved domains of astacin metalloendopeptidases, 
such as the motif that contains the tyrosine believed to bind 
to zinc. 

Knowing the nucleic acid sequence of certain parasite 
astacin metalloendopeptidase nucleic acid molecules of the 
present invention allows one skilled in the art to make copies 
of those nucleic acid molecules as well as to obtain nucleic 
acid molecules including at least a portion of such nucleic 
acid molecules and other parasite astacin metalloendopeptidase 
nucleic acid molecule homologues. Such nucleic acid molecules 
can be obtained in a variety of ways including screening 
appropriate expression libraries with antibodies of the 
present invention; traditional cloning techniques using 
oligonucleotide probes of the present invention to screen 
appropriate libraries or DNA; and PGR amplification of 
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appropriate libraries or DNA using oligonucleotide primers of 
the present invention. Preferred libraries to screen or from 
which to amplify include parasite larval (especially L3, L4) 
and adult cDNA libraries as well as genomic DNA libraries. 
Similarly^ preferred DNA sources to screen or from which to 
amplify include parasite larval (especially L3, L4) and adult 
cDNA and genomic DNA. Techniques to clone and amplify genes 
are disclosed, for example, in Sambrook et al., ibid. 

The present invention also includes nucleic acid 
molecules that are oligonucleotides capable of hybridizing, 
under stringent conditions, with complementary regions of 
other, preferably longer, nucleic acid molecules of the 
present invention, such as to complementary regions of a 
parasite astacin metalloendopeptidase gene, including 
complementary regions of a D. immitis astacin 
metalloendopeptidase gene. Such oligonucleotides can 

hybridize under stringent conditions with complementary 
regions of nDiMPA1^299A nDiMPA22i26/ L3 nDiMPA32292^ L3 nDiMPA32076^ 
adult nDiMPA32032A or adult nDiMPA32028A complementary regions of 
nucleic acid molecules that include at least a portion of 
nDiMPAl,299. nDiMPA22i26/ L3 nDiMPA32292/ L3 nDiMPA32076f adult 
nDiMPA32032/ or adult nDiMPA32028/ and complementary regions of 
nucleic acid molecules that hybridize under stringent 
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conditions with nDiMPAl^sgg/ nDiMPA22i26/ L3 nDiMPA32292/ L3 
nDiMPA32076/ adult nDiMPA32032/ or adult nDiMPA32028 • Such 
oligonucleotides can be RNA, DNA, or derivatives of either. 
The minimal size of such oligonucleotides is the size required 
to form a stable hybrid between a given oligonucleotide and 
the complementary sequence on another nucleic acid molecule of 
the present invention. As such, the size is dependent on 
nucleic acid composition and percent homology between the 
oligonucleotide and complementary sequence as well as upon 
hybridization conditions per se (e.g., temperature, salt 
concentration) . For AT-rich nucleic acid sequences, such as 
those of parasitic helminths, oligonucleotides typically are 
at least about 15 to about 17 bases in length. The size of 
the oligonucleotide must also be sufficient for the use of the 
oligonucleotide in accordance with the present invention. 
Oligonucleotides of the present invention can be used in a 
variety of applications including, but not limited to, as 
probes to identify additional nucleic acid molecules, as 
primers to amplify or extend nucleic acid molecules or in 
therapeutic applications to inhibit, for example, expression 
of astacin metalloendopeptidases by a parasite. Such 
therapeutic applications include the use of such 
oligonucleotides in, for example, antisense-, triplex 
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formation-, ribozyme- and/or RNA drug-based technologies. The 
present invention, therefore, includes such oligonucleotides 
and methods to interfere with the production of astacin 
metalloendopeptidase proteins by use of one or more of such 
technologies • Appropriate oligonucleotide-containing 

therapeutic compositions can be administered to an animal, 
using techniques known to those skilled in the art, either 
prior to or after infection by a parasite such as D, imaitisr 
in order to protect the animal from disease. 

Another embodiment of the present invention is an 
isolated filariid nematode cysteine protease protein or a 
mimetope thereof. As used herein, an isolated filariid 
nematode, or filariid, cysteine protease protein can be a 
full-length filariid cysteine protease protein or any 
homologue of such a protein. Filariid nematode cysteine 
protease proteins, including homologues thereof, can be 
isolated and produced according to the methods disclosed 
herein for parasite astacin metalloendopeptidase proteins. 
Homologues and mimetopes of filariid cysteine protease 
proteins are defined in a similar manner as are homologues and 
mimetopes of parasite astacin metalloendopeptidase proteins. 
Filariid cysteine protease proteins (including homologues) and 
mimetopes thereof each are capable of eliciting an immune 
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response (i.e,^ having at least one epitope capable of 
eliciting an immune response) against a filariid cysteine 
protease protein and/or are capable of effecting cysteine 
protease activity. Cysteine protease activity, as well as the 
ability of a protein to effect an immune response, can be 
measured using techniques known to those skilled in the art. 
Cysteine protease activity can be measured by its ability to 
cleave peptides having a cysteine protease cleavage site, such 
as z-Val-Leu-Arg-AMC; such activity can be inhibited by, for 
example, by the cysteine protease inhibitor E-64 (available 
from Boehringer Mannheim, Indianapolis, IN) . Preferred 
filariids are disclosed herein. A particularly preferred 
filariid is D. immitis. 

A filariid cysteine protease protein of the present 
invention, including any homologue thereof, has the additional 
characteristic of being encoded by a nucleic acid molecule 
that is capable of hybridizing, under stringent conditions, 
with the complement of the coding strand of a nucleic acid 
molecule comprising at least a portion of a nucleic acid 
sequence encoding a filariid cysteine protease protein. 
Preferred proteins are encoded by a nucleic acid molecule that 
forms stable hybrids with at least a portion of nDiCP:i43, the 
production of which is described in detail in the Examples. 
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SEQ ID NO: 12 represents the deduced sequence of nDiCPi43, the 
deduced translation product of which is a 47 amino acid 
sequence represented in SEQ ID NO: 13, the protein being 
denoted PDiCPi43. It should be noted that since nucleic acid 
sequencing technology is not entirely error-free, SEQ ID 
NO: 12, at best, represents an apparent nucleic acid sequence 
of a nucleic acid molecule encoding at least a portion of 
full-length D. immitis cysteine protease. Furthermore, SEQ ID 
NO: 13 apparently represents an internal amino acid sequence of 
D. immitis cysteine protease since SEQ ID NO: 12 apparently has 
neither a start nor stop codon. SEQ ID NO: 13, however, 
includes amino acid sequences that are conserved among a 
number of cysteine proteases. 

A comparison of SEQ ID NO: 13 with the corresponding 
regions of known parasite cysteine protease genes indicates 
that SEQ ID NO: 13 shares about 16 percent, about 22 percent, 
about 24 percent, about 35 percent, about 39 percent, about 44 
percent and about 49 percent homology at the amino acid level 
with cysteine proteases from, respectively, H. contortus (a 
nematode) , Schistosoma mansoni (a trematode) , C. elegans (a 
nematode) , Fasciola hepatica (a trematode) , Entamoeba 
histolytica (a protozoa) , Trypanosoma cruzi (a protozoa) and 
T. hrucie. SEQ ID NO: 13 also shares about 50 percent amino 
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acid homology with human cathepsin L, about 45 percent amino 
acid homology with chicken cathepsin L and about 56 percent 
amino acid homology with a Paragonimus westermani (trematode) 
cysteine protease. The serine at about position 30 and the 
cysteine at about position 37 of SEQ ID NO: 13 are conserved in 
all of these cysteine proteases. Please see, for example, the 
following for listings of the above-referenced sequences: 
Heussler et al . , 1994>. Mol. Biochem. Parasitol. 64, 11-23; 
Eakin et al . , 1990, Mol. Biochem. Parasitol. 39, 1-8; Ray et 
al., ibid., Pratt et al,, ibid., Sakanari et al,, ibid.; 
European Patent Application Publication No. 0524834A2, by 
Hamajima et al., published January 27, 1993. 

Preferred filariid cysteine protease proteins of the 
present invention include amino acid sequences that share at 
least about 60 percent, more preferably at least about 70 
percent, and even more preferably at least about 80 percent, 
homology with SEQ ID NO: 13, Particularly preferred filariid 
cysteine protease proteins of the present invention include 
PDiCPi42, proteins that include PDiCP^42 (including, but not 
limited to full-length proteins, fusion proteins and 
multivalent protective proteins), and proteins that include at 
least a portion of PDiCPi42- 
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A preferred filariid cysteine protease protein or 
mimetope thereof is a compound that when administered to an 
animal in an effective manner^ is capable of protecting that 
animal from disease caused by a parasite that is susceptible 
to treatment with a composition that inhibits cysteine 
protease activity. A suitable parasite to target is any 
parasite that produces a protein having one or more epitopes 
that can be targeted by a humoral and/or cellular response 
against a filariid nematode cysteine protease protein of the 
present invention and/or that can be targeted by a compound 
that is capable of substantially inhibiting filariid cysteine 
protease activity, thereby resulting in the reduced ability of 
the parasite to cause disease in an animal. Suitable and 
preferred parasites are disclosed above, A preferred class of 
parasites to target include tissue-migrating parasitic 
helminths. A particularly preferred nematode to target is D. 
immi tls, which causes heartworm. 

Also included in the present invention are fusion 
proteins and multivalent protective proteins that include at 
least one filariid cysteine protease protein. Such proteins 
can comprise fusion segments and/or multiple protective 
domains similar as disclosed for parasite astacin 
metalloendopeptidase proteins and can be produced in a similar 
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manner as described for parasite astacin metalloendopeptidase 
proteins of the present invention. Particularly preferred 
fusion proteins include PHIS-PDiCPi42 * 

Yet another embodiment of the present invention is an 
isolated filariid nematode nucleic acid molecule capable of 
hybridizing, under stringent conditions, with a D. immitis 
cysteine protease gene. Such a nucleic acid molecule is 
referred to as a filariid nematode, or filariid, cysteine 
protease nucleic acid molecule. As used herein, a filariid 
cysteine protease gene includes all nucleic acid sequences 
related to a natural filariid cysteine protease gene such as 
regulatory regions that control production of a D. immitis 
cysteine protease protein encoded by that gene as well as the 
coding region itself. A filariid cysteine protease nucleic 
acid molecule of the present invention can include an isolated 
natural filariid cysteine protease gene or a homologue 
thereof. A nucleic acid molecule of the present invention can 
include one or more regulatory regions, full-length or partial 
coding regions, or combinations thereof. The minimal size of 
a filariid cysteine protease nucleic acid molecule of the 
present invention is the minimal size capable of forming a 
stable hybrid under stringent hybridization conditions with a 
D. immitis cysteine protease gene. Filariid immitis cysteine 



2618-21'1-C1 

protease nucleic acid molecules can be isolated and produced 
according to the methods taught herein for the production and 
isolation of parasite astacin metalloendopeptidase nucleic 
acid molecules. Cysteine protease nucleic acid molecule 
homologues can be selected from a mixture of modified nucleic 
acids by screening for the function of the protein encoded by 
the nucleic acid (e.g., cysteine protease activity and/or 
ability to elicit an immune response against at least one 
epitope of a filariid cysteine protease protein) and/or by 
hybridization with isolated D. immitis cysteine protease 
nucleic acids under stringent conditions. 

An isolated filariid cysteine protease nucleic acid 
molecule of the present invention can include a nucleic acid 
sequence that encodes at least one filariid cysteine protease 
protein of the present invention, examples of such proteins 
being disclosed herein. As heretofore disclosed, filariid 
cysteine protease proteins of the present invention include, 
but are not limited to, proteins having full-length filariid 
cysteine protease coding regions, proteins having partial 
filariid cysteine protease coding regions, fusion proteins, 
multivalent protective proteins and combinations thereof. The 
present invention also includes nucleic acid molecules 
encoding filariid cysteine protease proteins that have been 
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modified to accoinmodate codon usage properties of the cells in 
which such nucleic acid molecules are to be expressed. 

One embodiment of the present invention is a filariid 
cysteine protease nucleic acid molecule that includes a 
5 nucleic acid sequence that is capable of hybridizing under 
stringent conditions with D, immitis nucleic acid molecule 
nDiCPi43, the deduced sequence of which is disclosed in SEQ ID 
NO: 12, Preferred filariid cysteine protease nucleic acid 
molecules encode proteins having at least about 60 percent, 

10 more preferably at least about 70 percent and even more 
preferably at least about 80 percent, amino acid homology with 
SEQ ID NO: 13, More preferred is a nucleic acid molecule that 
encodes a D. iwmitis cysteine protease protein that comprises 
at least a portion of SEQ ID NO: 13. 

15 A preferred nucleic acid molecule of the present 

invention includes at least a portion of D. immitis nucleic 
acid molecule nDiCPi43, Such a nucleic acid molecule can be 
nDiCP;i43, can include nucleotides in addition to nDiCPi43 (such 
as, but not limited to, a nucleic acid molecule encoding a 

20 full-length protein, a nucleic acid molecule encoding a fusion 
protein, or a nucleic acid molecule encoding a multivalent 
protective compound) , or can be a truncation fragment of 
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nDiCPi43. Particularly preferred filariid cysteine protease 
nucleic acid molecules include nDiCPi43 and nDiCPi42. 

The inventors of the present invention had difficulty 
isolating a D. immitis cysteine protease nucleic acid molecule 
5 despite the wide variety of cysteine protease genes that have 
been cloned. Primers designed by the inventors using 
consensus sequences derived from known cysteine protease 
genes, including from known parasite cysteine protease genes yr 
had a degeneracy that was essentially too great to pull out a 

10 D. immitis cysteine protease gene. The inventors discovered 
that use of primers that incorporated D. immitis codon usage 
bias enabled the identification of the first D. immitis 
cysteine protease nucleic acid molecule, namely nDiCPa43. 

Having identified the nucleic acid molecule nDiCPi43, it 

15 is likely that one skilled in the art can make copies of that 
nucleic acid molecule as well as obtain other filariid 
nematode cysteine protease nucleic acid molecules including 
full-length genes and homologues thereof. Such nucleic acid 
molecules can be obtained in a variety of ways such as those 

2 0 described for the isolation and production of parasite astacin 
metalloendopeptidase nucleic acid molecules of the present 
invention. Preferred libraries to screen or from which to 
amplify include filariid larval (especially L3, L4) and adult 
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cDNA libraries and filariid genomic DNA libraries. Similarly, 
preferred DNA sources to screen or from which to amplify 
include filariid larval (especially L3, L4) and adult cDNA, as 
well as filariid genomic DNA, Preferred primers and probes to 
use are codon-biased for the given filariid. 

The present invention also includes nucleic acid 
molecules that are oligonucleotides capable of hybridizing, 
under stringent conditions, with complementary regions of 
other, preferably longer, nucleic acid molecules of the 
present invention, such as to complementary regions of a 
filariid cysteine protease gene. Such oligonucleotides can 
hybridize under stringent conditions with complementary 
regions of nDiCPi43, complementary regions of nucleic acid 
molecules that include at least a portion of nDiCPi43, and 
complementary regions of nucleic acid molecules that hybridize 
under stringent conditions with nDiCPi43. Such 
oligonucleotides can be RNA, DNA, or derivatives of either. 
Other criteria, such as minimal size, as well as methods to 
produce and use such oligonucleotides are as disclosed for 
parasite astacin metalloendopeptidase oligonucleotides of the 
present invention . 

The present invention also includes a recombinant vector, 
which includes at least one nucleic acid molecule of the 
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present invention (e.g,, a parasite astacin 
metalloendopeptidase nucleic acid molecule and/or a filariid 
cysteine protease nucleic acid molecule, examples of which are 
disclosed herein) inserted into any vector capable of 
5 delivering the nucleic acid molecule into a host cell. Such 
a vector contains heterologous nucleic acid sequences, that is 
nucleic acid sequences that are not naturally found adjacent 
to nucleic acid molecules of the present invention and that 
preferably are derived from a species other than the species 

10 from which the nucleic acid molecule (s) are derived. The 
vector can be either RNA or DNA, either prokaryotic or 
eukaryotic, and typically is a virus or a plasmid. 
Recombinant vectors can be used in the cloning, sequencing, 
and/or otherwise manipulating of parasite, including D. 

15 immitis, nucleic acid molecules of the present invention. One 
type of recombinant vector, herein referred to as a 
recombinant molecule and described in more detail below, can 
be used in the expression of nucleic acid molecules of the 
present invention. Preferred recombinant vectors are capable 

20 of replicating in the transformed cell. 

Preferred nucleic acid molecules to include in 
recombinant vectors of the present invention include at least 
one of the following: a nucleic acid molecule that includes at 
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least a portion of nDiMPAli299/ a nucleic acid molecule that 
includes at least a portion of nDiMPA22i26/ a nucleic acid 
molecule that includes at least a portion of L3 nDiMPA32292^ ^ 
nucleic acid molecule that includes at least a portion of L3 
5 nDiMPA32076A a nucleic acid molecule that includes at least a 
portion of adult nDiMPA32032/ a nucleic acid molecule that 
includes at least a portion of adult nDiMPA32028/ or a nucleic 
acid molecule that includes at least a portion of nDiCPi43, 
Particularly preferred nucleic acid molecules to include in 

10 recombinant vectors of the present invention include 
nDiMPAlg89/ nDiMPAl:L299/ nDiMPA22i26/ nDiMPA2804/ nDiMPA227iA L3 
nDiMPA32292^ nDiMPA32076/ adult nDiMPA32032/ adult nDiMPA32028A 
BVMPA2 nDiCPi43 and nDiCPi42. 

Isolated proteins of the present invention can be 

15 produced in a variety of ways, including production and 
recovery of natural proteins, production and recovery of 
recombinant proteins, and chemical synthesis of the proteins. 
In one embodiment, an isolated protein of the present 
invention is produced by culturing a cell capable of 

20 expressing the protein under conditions effective to produce 
the protein, and recovering the protein. A preferred cell to 
culture is a recombinant cell that is capable of expressing 
the protein, the recombinant cell being produced by 



57 



2618-21-1-C1 

transforming a host cell with one or more nucleic acid 
molecules of the present invention. Transformation of a 
nucleic acid molecule into a cell can be accomplished by any 
method by which a nucleic acid molecule can be inserted into 
5 the cell. Transformation techniques include^ but are not 
limited to, transf ection, electroporation, microinjection, 
lipofection, adsorption, and protoplast fusion. A recombinant 
cell may remain unicellular or may grow into a tissue, organ 
or a multicellular organism. Transformed nucleic acid 

10 molecules of the present invention can remain extrachromosomal 
or can integrate into one or more sites within a chromosome of 
the transformed (i.e., recombinant) cell in such a manner that 
their ability to be expressed is retained. A preferred 
nucleic acid molecule with which to transform a cell is a 

15 nucleic acid molecule that includes a parasite astacin 
metalloendopeptidase nucleic acid molecule and/or a filariid 
cysteine protease nucleic acid molecule of the present 
invention. Particularly preferred nucleic acid molecules with 
which to transform cells include nDlMPAlggg (characterized by 

20 a coding strand having the nucleic acid sequence of SEQ ID 
NO:20), nDiMPAl-L299 (characterized by a coding strand having 
the nucleic acid sequence of SEQ ID N0:1), nDiMPA22i26 
(characterized by a coding strand having the nucleic acid 
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sequence of SEQ ID N0:2), nDiMPA2eo4 (characterized by a coding 
strand having the nucleic acid sequence of SEQ ID N0:21), 
nDiMPA227i (characterized by a coding strand having the nucleic 
acid sequence of SEQ ID NO:22), L3 nDiMPA32292 (characterized 
5 by a coding strand having the nucleic acid sequence of SEQ ID 
NO:29), L3 nDiMPAsove (characterized by a coding strand having 
the nucleic acid sequence of SEQ ID NO: 30), adult nDiMPA32032 
(characterized by a coding strand having the nucleic acid 
sequence SEQ ID NO:32), adult nDiMPA32028 (characterized by a 
y 10 coding strand having the nucleic acid sequence SEQ ID NO:33), 
y BvMPA2, nDiCPi43 (characterized by a coding strand having the 

nucleic acid sequence of SEQ ID NO: 12) and nDiCPi42 
^: (characterized by a coding strand having the nucleic acid 

sequence of SEQ ID NO: 23) . 
15 Suitable host cells to transform include any cell that 

^H,;: can be transformed. Host cells can be either untransf ormed 

^ cells or cells that are already transformed with at least one 

nucleic acid molecule. Host cells of the present invention 
either can be endogenously (i-e,, naturally) capable of 
20 producing parasite, including D, imaitis, proteins of the 
present invention or can be capable of producing such proteins 
after being transformed with at least one nucleic acid 
molecule of the present invention. Host cells of the present 
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invention can be any cell capable of producing at least one 
protein of the present invention, including bacterial, fungal 
(including yeast) , animal parasite (including helminth, 
protozoa and ectoparasite), insect, animal and plant cells. 
5 Preferred host cells include bacterial, mycobacterial, yeast, 
helminth, insect and mammalian cells. More preferred host 
cells include Salmonella^ Escherichia, Bacillus, 
Saccharomyces, Spodopteraf Mycobacteria, Trichoplusia, BHK 
(baby hamster kidney) cells, MDCK cells (normal dog kidney 

10 cell line for canine herpesvirus cultivation) , CRFK cells 
(normal cat kidney cell line for feline herpesvirus 
cultivation) and COS cells. Particularly preferred host cells 
are Escherichia coli, including E. coli K-12 derivatives; 
Salmonella typhi; Salmonella typhimurium, including attenuated 

15 strains such as UK-1 ^3987 and SR-11 40^72; Spodoptera 
frugiperda; Trichoplusia nij BHK cells; MDCK cells; CRFK cells 
and non-tumorigenic mouse myoblast G8 cells (e.g., ATCC CRL 
1246) . Additional appropriate mammalian cell hosts include 
other kidney cell lines (e.g., CV-1 monkey kidney cell lines), 

20 other fibroblast cell lines (e.g., human, murine or chicken 
embryo fibroblast cell lines) , myeloma cell lines, Chinese 
hamster ovary cells and/or HeLa cells . 
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A recombinant cell is preferably produced by transforming 
a host cell with one or more recombinant molecules, each 
comprising one or more nucleic acid molecules of the present 
invention operatively linked to an expression vector 
5 containing one or more transcription control sequences. The 
phrase operatively linked refers to insertion of a nucleic 
acid molecule into an expression vector in a manner such that 
the molecule is able to be expressed when transformed into a 
host cell. As used herein, an expression vector is a DNA or 

10 RNA vector that is capable of transforming a host cell and of 
effecting expression of a specified nucleic acid molecule. 
Preferably, the expression vector is also capable of 
replicating within the host cell. Expression vectors can be 
either prokaryotic or eukaryotic, and are typically viruses or 

15 plasmids. Expression vectors of the present invention include 
any vectors that function (i,e,, direct gene expression) in 
recombinant cells of the present invention, including in 
bacterial, fungal, animal parasite, insect, animal, and plant 
cells. Preferred expression vectors of the present invention 

20 can direct gene expression in bacterial, yeast, helminth or 
other parasite, insect and mammalian cells and more preferably 
in the cell types heretofore disclosed. 
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Expression vectors of the present invention may also (a) 
contain secretory signals (i.e*^. signal segment nucleic acid 
sequences) to enable an expressed protein of the present 
invention (i«e,, a parasite astacin metalloendopeptidase 
5 protein and/or a filariid cysteine protease protein) to be 
secreted from the cell that produces the protein and/or (b) 
contain fusion sequences which lead to the expression of 
inserted nucleic acid molecules of the present invention as 
fusion proteins* Eukaryotic recombinant molecules may include 

10 intervening and/or untranslated sequences surrounding and/or 
within the nucleic acid sequences of nucleic acid molecules of 
the present invention. Examples of suitable fusion segments 
encoded by fusion segment nucleic acids have been disclosed. 
Suitable signal segments include natural signal segments 

15 (e.g.^ a parasite astacin metalloendopeptidase or cysteine 
protease signal segment) or any heterologous signal segment 
capable of directing the secretion of a protein of the present 
invention. Preferred signal segments include, but are not 
limited to, tissue plasminogen activator (t-PA) , interferon, 

20 interleukin, growth hormone, histocompatibility and viral 
envelope glycoprotein signal segments. 

Nucleic acid molecules of the present invention can be 
operatively linked to expression vectors containing regulatory 
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sequences such as promoters, operators, repressors, enhancers, 
termination sequences, origins of replication, and other 
regulatory sequences that are compatible with the recombinant 
cell and that control the expression of nucleic acid molecules 
5 of the present invention. In particular, recombinant 
molecules of the present invention include transcription 
control sequences. Transcription control sequences are 
sequences which control the initiation, elongation, and 
termination of transcription. Particularly important 

10 transcription control sequences are those which control 
transcription initiation, such as promoter, enhancer, operator 
and repressor sequences. Suitable transcription control 
sequences include any transcription control sequence that can 
function in at least one of the recombinant cells of the 

15 present invention. A variety of such transcription control 
sequences are known to those skilled in the art. Preferred 
transcription control sequences include those which function 
in bacterial, yeast, helminth or other parasite, insect and 
mammalian cells, such as, but not limited to, tac, iac, trp, 

20 trc, oxy-pro, omp/lpp, rrnB, bacteriophage lambda (A) (such as 
Apl and Xp^^ and fusions that include such promoters) , 
bacteriophage T7, T7iac, bacteriophage T3, bacteriophage SP6, 
bacteriophage SPOl, metallothionein, alpha mating factor. 
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Pichia alcohol oxidase, alphavirus subgenomic promoters (such 
as Sindbis virus subgenomic promoters) , baculovirus, Heliothis 
zea insect virus, vaccinia virus, herpesvirus, poxvirus, 
adenovirus, simian virus 40, retrovirus actin, retroviral long 
5 terminal repeat, Rous sarcoma virus, heat shock, phosphate and 
nitrate transcription control sequences as well as other 
sequences capable of controlling gene expression in 
prokaryotic or eukaryotic cells. Additional suitable 
transcription control sequences include tissue-specific 

10 promoters and enhancers as well as lymphokine-inducible 
promoters (e , g. , promoters inducible by interferons or 
interleukins) . Transcription control sequences of the present 
invention can also include naturally occurring transcription 
control sequences naturally associated with a parasitic 

15 helminth, such as a D. immitiSf molecule prior to isolation. 

A recombinant molecule of the present invention is a 
molecule that can include at least one of any nucleic acid 
molecule heretofore described operatively linked to at least 
one of any transcription control sequence capable of 

2 0 effectively regulating expression of the nucleic acid 
molecule (s) in the cell to be transformed, examples of which 
are disclosed herein. Particularly preferred recombinant 
molecules include p3gal-nDiMPAl;L299/ p3gal-nDiMPA22i26A 
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ptrcHis-nDiMPA2804/ pAPRHis-nDiMPA2804/ pBBIII-nDiMPA22i26/ 
p(3gal-L3-nDiMPA32292A p3gal-L3-nDiMPA32076/ pPgal-adult- 
nDiMPA32032/ ppgal-adult-nDiMPA32028 ptrcHis-nDiCPi42 . 

Details regarding the production of such recombinant molecules 
5 is disclosed herein. 

A recombinant cell of the present invention includes any 
cell transformed with at least one of any nucleic acid 
molecule of the present invention, A preferred recombinant 
cell is a cell transformed with a nucleic acid molecule that 

10 includes at least a portion of a parasite astacin 
metalloendopeptidase nucleic acid molecule, such as nDiMPAlggg, 
nDiMPAl;L299A nDiMPA22i26/ nDiMPA2804A nDiMPA227i, L3 nDiMPA32292/ L3 
nDiMPA32076/ adult nDiMPA32032/ adult nDiMPA32028/ or BvMPA2 and/or 
at least a portion of a filariid cysteine protease nucleic 

15 acid molecule, such as a nucleic acid molecule including 
nDiCPi43 or nDiCP-L42 • Particularly preferred recombinant cells 
include E. coli :p3gal-nDiMPAl ^299/ coii : p3gal-nDiMPA22i26/ 
coii :ptrcHis-nDiMPA2804A ^. coli : pAPRHis-nDiMPA2804^ S. 
-frugiperda :pBBIII-nDiMPA22i26A coll ;p3gal-L3-nDiMPA32292A ^• 

20 coii :p3gal-L3-nDiMPA32076/ coii : ppgal-adult"nDiMPA32032' ^• 

coJi ;p3gal-adult-nDiMPA32028/ coii : ptrcHis-nDiCPi42 • 

Details regarding the production of these recombinant cells is 
disclosed herein. 
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Recombinant cells of the present invention can also be 
co-transformed with one or more recombinant molecules 
including nucleic acid molecules encoding one or more proteins 
of the present invention (e.g», parasite astacin 
5 metalloendopeptidase proteins and/or filariid cysteine 
protease proteins) and one or more other proteins useful in 
the production of multivalent vaccines which can include one 
or more protective compounds. 

It may be appreciated by one skilled in the art that use 

10 of recombinant DNA technologies can improve expression of 
transformed nucleic acid molecules by manipulating, for 
example, the number of copies of the nucleic acid molecules 
within a host cell, the efficiency with which those nucleic 
acid molecules are transcribed, the efficiency with which the 

15 resultant transcripts are translated, and the efficiency of 
post-translational modifications * Recombinant techniques 
useful for increasing the expression of nucleic acid molecules 
of the present invention include, but are not limited to, 
operatively linking nucleic acid molecules to high-copy number 

20 plasmids, integration of the nucleic acid molecules into one 
or more host cell chromosomes, addition of vector stability 
sequences to plasmids, substitutions or modifications of 
transcription control signals (e.g., promoters, operators. 
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enhancers) , substitutions or modifications of translational 
control signals (e.g., ribosome binding sites, Shine-Dalgarno 
sequences) , modification of nucleic acid molecules of the 
present invention to correspond to the codon usage of the host 
5 cell, deletion of sequences that destabilize transcripts, and 
use of control signals that temporally separate recombinant 
cell growth from recombinant enzyme production during 
fermentation. The activity of an expressed recombinant 
protein of the present invention may be improved by 
^ 10 fragmenting, modifying, or derivatizing nucleic acid molecules 
y encoding such a protein. 

In accordance with the present invention, recombinant 
^ cells of the present invention can be used to produce one or 

1^ more proteins of the present invention by culturing such cells 

^ 15 under conditions effective to produce such a protein, and 
^ recovering the protein. Effective conditions to produce a 

^ protein include, but are not limited to, appropriate media, 

bioreactor, temperature, pH and oxygen conditions that permit 
protein production. An appropriate medium refers to any 
20 medium in which a cell of the present invention, when 
cultured, is capable of producing a parasite protein, 
including a D. immitis protein, of the present invention. An 
effective medium is typically an aqueous medium comprising 
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assimilable carbohydrate, nitrogen and phosphate sources, as 
well as appropriate salts, minerals, metals and other 
nutrients, such as vitamins. The medium may comprise complex 
nutrients or may be a defined minimal medium. Cells of the 
present invention can be cultured in conventional fermentation 
bioreactors, which include, but are not limited to, batch, 
fed-batch, cell recycle, and continuous fermentors. Culturing 
can also be conducted in shake flasks, test tubes, microtiter 
dishes, and petri plates. Culturing is carried out at a 
temperature, pH and oxygen content appropriate for the 
recombinant cell. Such culturing conditions are well within 
the expertise of one of ordinary skill in the art. Examples 
of suitable conditions are included in the Examples section. 

Depending on the vector and host system used for 
production, resultant proteins of the present invention may 
either remain within the recombinant cell; be secreted into 
the fermentation medium; be secreted into a space between two 
cellular membranes, such as the periplasmic space in E. coll; 
or be retained on the outer surface of a cell or viral 
membrane. The phrase "recovering the protein" refers simply 
to collecting the whole fermentation medium containing the 
protein and need not imply additional steps of separation or 
purification. Proteins of the present invention can be 
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purified using a variety of standard protein purification 
techniques, such as, but not limited to, affinity 
chromatography, ion exchange chromatography, filtration, 
electrophoresis, hydrophobic interaction chromatography, gel 
5 filtration chromatography, reverse phase chromatography, 
concanavalin A chromatography, chromatof ocusing and 
differential solubilization. Proteins of the present 

invention are preferably retrieved in "substantially pure" 
form. As used herein, "substantially pure" refers to a purity 
O 10 that allows for the effective use of the protein as a 
y therapeutic composition or diagnostic. A vaccine for animals, 

^•f for example, should exhibit no substantial toxicity and should 

"^j be capable of stimulating the production of antibodies in a 

Z_ vaccinated animal . 

15 The present invention also includes isolated antibodies 

^ capable of selectively binding to a protein of the present 

^ invention or to a mimetope thereof • Antibodies capable of 

selectively binding to a parasite astacin metalloendopeptidase 
protein of the present invention are referred to as anti- 
20 parasite astacin metalloendopeptidase antibodies. A 
particularly preferred antibody of this embodiment is an anti- 
D. immitis astacin metalloendopeptidase antibody. Antibodies 
capable of selectively binding to a filariid cysteine protease 
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protein of the present invention are referred to as anti- 
filariid cysteine protease antibodies. A particularly 
preferred antibody of this embodiment is an anti-D. immitis 
cysteine protease antibody. Isolated antibodies are 

antibodies that have been removed from their natural milieu. 
The term "isolated" does not refer to the state of purity of 
such antibodies. As such, isolated antibodies can include 
anti-sera containing such antibodies, or antibodies that have 
been purified to varying degrees. As used herein, the term 
"selectively binds to" refers to the ability of such 
antibodies to preferentially bind to specified proteins and 
mimetopes thereof of the present invention. Binding can be 
measured using a variety of methods known to those skilled in 
the art including immunoblot assays, immunoprecipitation 
assays, radioimmunoassays, enzyme immunoassays (e.g., ELISA) , 
immunof Increscent antibody assays and immunoelectron 
microscopy; see, for example, Sambrook et al., ibid. 

Antibodies of the present invention can be either 
polyclonal or monoclonal antibodies . Antibodies of the 
present invention include functional equivalents such as 
antibody fragments and genetically-engineered antibodies, 
including single chain antibodies, that are capable of 
selectively binding to at least one of the epitopes of the 
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protein or raimetope used to obtain the antibodies. Preferred 
antibodies are raised in response to proteins, or mimetopes 
thereof, that are encoded, at least in part, by a nucleic acid 
molecule of the present invention • 
5 A preferred method to produce antibodies of the present 

invention includes (a) administering to an animal an effective 
amount of a protein or mimetope thereof of the present 
invention to produce the antibodies and (b) recovering the 
antibodies. Antibodies raised against defined proteins or 
tj 10 mimetopes can be advantageous because such antibodies are not 
y substantially contaminated with antibodies against other 

substances that might otherwise cause interference in a 
diagnostic assay or side effects if used in a therapeutic 
composition. 

^ 15 Antibodies of the present invention have a variety of 

H: potential uses that are within the scope of the present 

^ invention. For example, such antibodies can be used (a) as 

vaccines to passively immunize an animal in order to protect 
the animal from parasites susceptible to treatment by such 
20 antibodies, (b) as reagents in assays to detect infection by 
such parasites and/or (c) as tools to recover desired proteins 
of the present invention from a mixture of proteins and other 
contaminants . 
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Furthermore, antibodies of the present invention can be 
used to target cytotoxic agents to parasites of the present 
invention in order to directly kill such parasites. Targeting 
can be accomplished by conjugating (i.e., stably joining) such 
antibodies to the cytotoxic agents using techniques known to 
those skilled in the art. Suitable cytotoxic agents include, 
but are not limited to: double-chain toxins (i.e., toxins 
having A and B chains), such as diphtheria toxin, ricin toxin, 
Pseudomonas exotoxin, modeccin toxin, abrin toxin, and shiga 
toxin; single-chain toxins, such as pokeweed antiviral 
protein, a-amanitin, and ribosome inhibiting proteins; and 
chemical toxins, such as melphalan, methotrexate, nitrogen 
mustard, doxorubicin and daunomycin. Preferred double-chain 
toxins are modified to include the toxic domain and 
translocation domain of the toxin but lack the toxin's 
intrinsic cell binding domain. 

One embodiment of the present invention is a therapeutic 
composition that, when administered to an animal in an 
effective manner, is capable of protecting that animal from 
disease caused by a parasite that is susceptible to at least 
one of the following treatments: immunization with an 
isolated parasite astacin metalloendopeptidase of the present 
invention, immunization with an isolated filariid cysteine 
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protease of the present invention, administration of an 
inhibitor of astacin metalloendopeptidase activity or 
administration of an inhibitor of cysteine protease activity. 
As used herein, a parasite that is susceptible to such a 
5 treatment is a parasite that, if such treatment is 
administered to an animal in an effective manner, shows 
substantially reduced ability to cause disease in the animal. 
It is to be understood that such parasite can be susceptible 
to treatments other than just those listed immediately above. 
10 Such treatments can include, but are not limited to, 
additional treatments, or therapeutic compositions, disclosed 
herein. 

Therapeutic compositions of the present invention include 
at least one of the following protective compounds: (a) an 

15 isolated parasite astacin metalloendopeptidase protein or a 
mimetope thereof; (b) an isolated parasite nucleic acid 
molecule capable of hybridizing under stringent conditions 
with a D. immitis astacin metalloendopeptidase gene; (c) an 
anti-parasite astacin metalloendopeptidase antibody; (d) an 

20 inhibitor of astacin metalloendopeptidase activity identified 
by its ability to inhibit parasite astacin 
metalloendopeptidase activity; (e) an isolated filariid 
nematode cysteine protease protein or a mimetope thereof; (f) 
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an isolated filariid nematode nucleic acid molecule capable of 
hybridizing under stringent conditions with a D. immitis 
cysteine protease gene; (g) an anti-f ilariid nematode cysteine 
protease antibody; and (h) an inhibitor of cysteine protease 
5 activity identified by its ability to inhibit filariid 
nematode cysteine protease activity. As used herein, a 
protective compound refers to a compound that, when 
administered to an animal in an effective manner, is able to 
treat, ameliorate, and/or prevent disease caused by a parasite 

10 of the present invention. Preferred parasites to target are 
heretofore disclosed. Examples of proteins, nucleic acid 
molecules and antibodies of the present invention are 
disclosed herein. Astacin metalloendopeptidase inhibitors and 
cysteine protease inhibitors of the present invention are 

15 described in more detail below. 

The present invention also includes a therapeutic 
composition comprising at least one astacin parasite 
metalloendopeptidase-based or filariid nematode cysteine 
protease-based protective compound of the present invention in 

2 0 combination with at least one additional compound protective 
against one or more infectious agents. Examples of such 
compounds and infectious agents are heretofore disclosed. 
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Therapeutic compositions of the present invention can be 
administered to any animal susceptible to such therapy, 
preferably to mammals, and more preferably to dogs, cats, 
humans, ferrets, horses, cattle, sheep and other pets and/or 
5 economic food animals. Preferred animals to protect include 
dogs, cats, humans and ferrets, with dogs and cats being 
particularly preferred. 

In one embodiment, a therapeutic composition of the 
present invention can be administered to the vector in which 

10 the parasite develops from a microfilaria into L3, such as to 
a mosquito in order to prevent the spread of heartworm. Such 
administration could be orally or by developing transgenic 
vectors capable of producing at least one therapeutic 
composition of the present invention. In another embodiment, 

15 a vector, such as a mosquito, can ingest therapeutic 
compositions present in the blood of a host that has been 
administered a therapeutic composition of the present 
invention. 

Therapeutic compositions of the present invention can be 
20 formulated in an excipient that the animal to be treated can 
tolerate . Examples of such excipient s include water, 

saline, Ringer's solution, dextrose solution. Hank's solution, 
and other aqueous physiologically balanced salt solutions. 
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Nonaqueous vehicles, such as fixed oils, sesame oil, ethyl 
oleate, or triglycerides may also be used. Other useful 
formulations include suspensions containing viscosity 
enhancing agents, such as sodium carboxymethylcellulose, 
sorbitol, or dextran, Excipients can also contain minor 
amounts of additives, such as substances that enhance 
isotonicity and chemical stability. Examples of buffers 
include phosphate buffer, bicarbonate buffer and Tris buffer, 
while examples of preservatives include thimerosal, m- or o- 
cresol, formalin and benzyl alcohol. Standard formulations 
can either be liquid injectables or solids which can be taken 
up in a suitable liquid as a suspension or solution for 
injection. Thus, in a non-liquid formulation, the excipient 
can comprise dextrose, human serum albumin, preservatives, 
etc., to which sterile water or saline can be added prior to 
administration. 

In one embodiment of the present invention, the 
therapeutic composition can also include an immunopotentiator, 
such as an adjuvant or a carrier. Adjuvants are typically 
substances that generally enhance the immune response of an 
animal to a specific antigen. Suitable adjuvants include, but 
are not limited to, Freund's adjuvant; other bacterial cell 
wall components; aluminum-based salts; calcium-based salts; 
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silica; polynucleotides; toxoids; serum proteins; viral coat 
proteins; other bacterial-derived preparations; gamma 
interferon; block copolymer adjuvantS;^ such as Hunter's 
Titermax adjuvant (Vaxcel^yr Inc, Norcross, GA) ; Ribi adjuvants 
5 (available from Ribi ImmunoChem Research, Inc., Hamilton, MT) ; 
and saponins and their derivatives, such as Quil A (available 
from Superfos Biosector A/S, Denmark) . Carriers are typically 
compounds that increase the half-life of a therapeutic 
composition in the treated animal. Suitable carriers include, 

10 but are not limited to, polymeric controlled release 
formulations, biodegradable implants, liposomes, bacteria, 
viruses, oils, esters, and glycols. 

In order to protect an animal from disease caused by a 
parasite of the present invention, a therapeutic composition 

15 of the present invention is administered to the animal in an 
effective manner such that the composition is capable of 
protecting that animal from a disease caused by a parasite. 
For example, an isolated protein or mimetope thereof, when 
administered to an animal in an effective manner, is able to 

20 elicit (i.e., stimulate) an immune response, preferably 
including both a humoral and cellular response, that is 
sufficient to protect the animal from the disease. Similarly, 
an antibody of the present invention, when administered to an 
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animal in an effective manner, is administered in an amount so 
as to be present in the animal at a titer that is sufficient 
to protect the animal from the disease, at least temporarily. 
Oligonucleotide nucleic acid molecules of the present 
5 invention can also be administered in an effective manner, 
thereby reducing expression of astacin metalloendopeptidase or 
cysteine protease proteins in order to interfere with 
development of parasites targeted in accordance with the 
present invention • 

10 Therapeutic compositions of the present invention can be 

administered to animals prior to infection in order to prevent 
infection and/or can be administered to animals after 
infection in order to treat disease caused by the parasite. 
For example, proteins, mimetopes thereof, and antibodies 

15 thereof can be used as immunotherapeutic agents. 

Acceptable protocols to administer therapeutic compositions in 
an effective manner include individual dose size, number of 
doses , frequency of dose administration, and mode of 
administration. Determination of such protocols can be 

20 accomplished by those skilled in the art. A suitable single 
dose is a dose that is capable of protecting an animal from 
disease when administered one or more times over a suitable 
time period. For example, a preferred single dose of a 



78 



2618-21-1-C1 

protein, mimetope or antibody therapeutic composition is from 
about 1 microgram (ijg) to about 10 milligrams (mg) of the 
therapeutic composition per kilogram body weight of the 
animal. Booster vaccinations can be administered from about 
5 2 weeks to several years after the original administration. 
Booster vaccinations preferably are administered when the 
immune response of the animal becomes insufficient to protect 
the animal from disease. A preferred administration schedule 
is one in which from about 10 lag to about 1 mg of the vaccine 

10 per kg body weight of the animal is administered from about 
one to about two times over a time period of from about 2 
weeks to about 12 months. Modes of administration can 
include, but are not limited to, subcutaneous, intradermal, 
intravenous, nasal, oral, transdermal and intramuscular 

15 routes. 

According to one embodiment, a nucleic acid molecule of 
the present invention can be administered to an animal in a 
fashion to enable expression of that nucleic acid molecule 
into a protective protein or protective RNA (e.g., antisense 
20 RNA, ribozyme or RNA drug) in the animal to be protected from 
disease. Nucleic acid molecules can be delivered to an animal 
in a variety of methods including, but not limited to, 
(a) direct injection (e.g., as "naked" DNA or RNA molecules. 
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such as is taught, for example in Wolff et al., 1990, Science 
241, 1465-1468) or (b) packaged as a recombinant virus 
particle vaccine or as a recombinant cell vaccine (i,e,, 
delivered to a cell by a vehicle selected from the group 
5 consisting of a recombinant virus particle vaccine and a 
recombinant cell vaccine) . 

A recombinant virus particle vaccine of the present 
invention includes a recombinant molecule of the present 
invention that is packaged in a viral coat and that can be 

10 expressed in an animal after administration. Preferably, the 
recombinant molecule is packaging-deficient, A number of 
recombinant virus particles can be used, including, but not 
limited to, those based on alphaviruses, poxviruses, 
adenoviruses, herpesviruses, and retroviruses. Preferred 

15 recombinant particle viruses are those based on alphaviruses 
(such as Sindbis virus), herpesviruses and poxviruses. 
Methods to produce and use recombinant virus particle vaccines 
are disclosed in U.S . Patent Application Serial No . 
0 8/015/414, filed February 8, 1993, entitled "Recombinant 

20 Virus Particle Vaccines", which is incorporated by reference 
herein in its entirety. 

When administered to an animal, a recombinant virus 
particle vaccine of the present invention infects cells within 
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the immunized animal and directs the production of a 
protective protein or RNA nucleic acid molecule that is 
capable of protecting the animal from disease caused by a 
parasite of the present invention. For example, a recombinant 
5 virus particle comprising a D. immitis astacin 
metalloendopeptidase nucleic acid molecule and/or a D. immitis 
cysteine protease nucleic acid molecule of the present 
invention is administered according to a protocol that results 
in the animal producing a sufficient immune response to 

10 protect itself from heartworm. A preferred single dose of a 
recombinant virus particle vaccine of the present invention is 
from about 1 x 10^ to about 1 x 10'^ virus plaque forming units 
(pfu) per kilogram body weight of the animal. Administration 
protocols are similar to those described herein for protein- 

15 based vaccines, 

A recombinant cell vaccine of the present invention 
includes recombinant cells of the present invention that 
express at least one protein of the present invention. 
Preferred recombinant cells include Salmonellar E. coli, 

2 0 Mycobacterium, S. frugiperda, baby hamster kidney, myoblast 
G8, COS, MDCK and CRFK recombinant cells, with Salmonella 
recombinant cells being more preferred. Such recombinant 
cells can be administered in a variety of ways but have the 
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advantage that they can be administered orally, preferably at 
doses ranging from about 10^ to about 10 ^tacteria per 
kilogram body weight. Administration protocols are similar to 
those described herein for protein-based vaccines . 
Recombinant cell vaccines can comprise whole cells or cell 
lysates . 

In common with most other enteric pathogens. Salmonella 
strains normally enter the host orally. Once in the 
intestine, they interact with the mucosal surface, normally to 
establish an invasive infection. Most Salmonella infections 
are controlled at the epithelial surface, causing the typical 
Saimoneii a- induced gastroenteritis . Some strains of 

Salmonellar including 5. typhi and some 5. typhimurium 
isolates, have evolved the ability to penetrate deeper into 
the host, causing a disseminated systemic infection. It 
appears such strains have the capacity to resist the killing 
actions of macrophages and other immune cells. 5. typhi can 
exist for long periods as a facultative intracellular 
parasite. Some of the live vaccine strains can also persist 
for long periods in the mononuclear phagocyte system. Hosts 
infected in such a manner develop, in addition to a mucosal 
immune response, systemic cellular and serum antibody 
responses to the Salmonella. Thus, invading Salmonellar 
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whether virulent or attenuated, can stimulate strong immune 
responses, unlike many other enteric pathogens which only set 
up local, noninvasive gut infections . The potent 

immunogenicity of live Salmonella makes them attractive 
5 candidates for carrying nucleic acid molecules of the present 
invention, and the proteins they encode, to the immune system, 
A preferred recombinant cell-based vaccine is one in 
which the cell is attenuated. Salmonella typhimurium strains, 
for example, can be attenuated by introducing mutations into 

10 genes critical for in vivo growth and survival. For example, 
genes encoding cyclic adenosine monophosphate (cAMP) receptor 
protein or adenylate cyclase are deleted to produce avirulent, 
vaccine strains . Such strains can deliver antigens to 
lymphoid tissue in the gut but demonstrate reduced capacity to 

15 invade the spleen and mesenteric lymph nodes. These strains 
are still capable of stimulating both humoral and cellular 
immunity in mammalian hosts. 

Recombinant cell vaccines can be used to introduce 
proteins of the present invention into the immune systems of 

20 animals. For example, recombinant molecules comprising 
nucleic acid molecules of the present invention operatively 
linked to expression vectors that function in Salmonella can 
be transformed into Salmonella host cells. The resultant 
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recombinant cells are then introduced into the animal to be 
protected. Preferred Salmonella host cells are those for 
which survival depends on their ability to maintain the 
recombinant molecule (i.e., a balanced-lethal host-vector 
5 system) . An example of such a preferred host recombinant 
molecule combination is a Salmonella strain (e.g., UK-1 ^3987 
or SR-11 ^4 072 ) which is unable to produce aspartate (3- 
semialdehyde dehydrogenase in combination with a recombinant 
molecule also capable of encoding the enzyme. Aspartate 3- 

10 semialdehyde dehydrogenase, encoded by the asd gene, is an 
important enzyme in the pathway to produce diaminopimelic acid 
(DAP) . DAP is an essential component of the peptidoglycan of 
the cell wall of Gram-negative bacteria, such as Salmonella, 
and, as such, is necessary for survival of the cell. Thus, 

15 Salmonella lacking a functional asd gene can only survive if 
they maintain a recombinant molecule that is also capable of 
expressing a functional asd gene. 

In one embodiment, a nucleic acid molecule of the present 
invention is inserted into expression vector pTECH-1 

20 (available from Medeva, London, U.K.) and the resulting 
recombinant molecule is transfected into a Salmonella strain, 
such as BRD 509 (available from Medeva), to form a recombinant 
cell . Such recombinant cells can be used to produce the 
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corresponding encoded protein or can be used as recombinant 
cell vaccines. 

One preferred embodiment of the present invention is the 
use of nucleic acid molecules and proteins of the present 
5 invention, and particularly D. immitis nucleic acid molecules 
and proteins of the present invention;, to protect an animal 
from heartworm. Preferred therapeutic compositions are those 
that are able to inhibit at least one step in the portion of 
the parasite's development cycle that includes L3 larvae, 

10 third molt, L4 larvae, fourth molt and immature adult prior to 
entering the circulatory system. In dogs, this portion of the 
development cycle is about 70 days. As such, preferred 
therapeutic compositions include D. immitis astacin 
metalloendopeptidase-based and D. immitis cysteine protease- 

15 based therapeutic compounds of the present invention. Such 
compositions are administered to animals in a manner effective 
to protect the animals from heartworm. Additional protection 
may be obtained by administering additional protective 
compounds, including other D, immitis proteins, nucleic acid 

20 molecules and antibodies as heretofore disclosed. 

One embodiment of the present invention is the use of 
enzymatically active parasite astacin metalloendopeptidase 
and/or filariid nematode cysteine protease proteins of the 
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present invention to identify inhibitors of such enzyme 
activity. While not being bound by theory, it is believed 
that parasites use such proteases in a number of ways^^ 
including, but not limited to, to effect embryonic and larval 
5 development, to effect molting and to effect tissue migration 
both as larvae and adults. Such proteases are capable of 
degrading cutaneous connective tissue macromolecules as well 
as other proteinaceous material to facilitate such functions. 
It is also of interest that astacin metalloendopeptidases 

10 identified in sea urchins, Drosophila and Xenopus have been 
linked with development and maturation of the respectively 
organisms. As such, inhibitors of astacin 

metalloendopeptidase and/or cysteine protease activity could 
be particularly beneficial in disrupting embryonic and/or 

15 larval development or molting by parasites in general and 
tissue migration by those parasites capable of such migration. 
Use of parasite enzymes to develop such inhibitors is also 
advantageous because inhibitors can be identified that are 
highly selective for the parasite without causing undue side 

2 0 effects to the animal being treated . 

One therapeutic composition of the present invention 
includes an inhibitor of parasite astacin metalloendopeptidase 
activity, i.e., a compound capable of substantially 
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interfering with the function of a parasite astacin 
metalloendopeptidase susceptible to inhibition by an inhibitor 
of D. imml tls astacin metalloendopeptidase activity • An 
inhibitor of astacin metalloendopeptidase can be identified 
5 using enzymatically active parasite and preferably D. immitiSf 
astacin metalloendopeptidase proteins of the present 
invention • 

One embodiment of the present invention is a method to 
identify a compound capable of inhibiting astacin 

10 metalloendopeptidase activity of a parasite. Such a method 
includes the steps of (a) contacting (e.g., combining, mixing) 
an isolated parasite, preferably D. immitiSf astacin 
metalloendopeptidase protein with a putative inhibitory 
compound under conditions in which, in the absence of the 

15 compound, the protein has astacin metalloendopeptidase 
activity, and (b) determining if the putative inhibitory 
compound inhibits the astacin metalloendopeptidase activity. 
Putative inhibitory compounds to screen include organic 
molecules, antibodies (including functional equivalents 

20 thereof) and substrate analogs. Methods to determine astacin 
metalloendopeptidase activity are known to those skilled in 
the art; see, for example, Gomis-Ruth, et al . ibid. , and 
referenced cited therein. 
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The present invention also includes a test kit to 
identify a compound capable of inhibiting astacin 
metalloendopeptidase activity of a parasite. Such a test kit 
includes an isolated parasite^ preferably D. immitis, astacin 
metalloendopeptidase protein having astacin 

metalloendopeptidase activity and a means for determining the 
extent of inhibition of astacin metalloendopeptidase activity 
in the presence of (i.e., effected by) a putative inhibitory 
compound. 

Astacin metalloendopeptidase inhibitors isolated by such 
a method, and/or test kit, can be used to inhibit any astacin 
metalloendopeptidase that is susceptible to such an inhibitor. 
Preferred astacin metalloendopeptidase enzymes to inhibit are 
those produced by parasites. A particularly preferred astacin 
metalloendopeptidase inhibitor of the present invention is 
capable of protecting an animal from heartworm. It is also 
within the scope of the present invention to use inhibitors of 
the present invention to target astacin metalloendopeptidase- 
related disorders in animals. Therapeutic compositions 
comprising astacin metalloendopeptidase inhibitory compounds 
of the present invention can be administered to animals in an 
effective manner to protect animals from disease caused by the 
targeted astacin metalloendopeptidase enzymes, and preferably 
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to protect animals from heartworm. Effective amounts and 
dosing regimens can be determined using techniques known to 
those skilled in the art. 

Another therapeutic composition of the present invention 
5 includes an inhibitor of parasite cysteine protease activity, 
i.e., a compound capable of substantially interfering with the 
function of a parasite cysteine protease susceptible to 
inhibition by an inhibitor of filariid nematode cysteine 
protease activity, A cysteine protease inhibitor can be 
10 identified using enzymatically active filariid nematode, and 
"2 preferably D. immitis, cysteine protease proteins of the 

fi present invention. 

2 One embodiment of the present invention is a method to 

1,^ identify a compound capable of inhibiting cysteine protease 

%^ 15 activity of a parasite. Such a method includes the steps of 
(a) contacting (e.g., combining, mixing) an isolated filariid 
^ nematode, preferably D. ixtmitis, cysteine protease protein 

with a putative inhibitory compound under conditions in which, 
in the absence of the compound, the protein has cysteine 
20 protease activity, and (b) determining if the putative 
inhibitory compound inhibits the cysteine protease activity. 
Putative inhibitory compounds to screen include organic 
molecules, antibodies (including functional equivalents 
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thereof) and substrate analogs. Methods to determine cysteine 
protease activity are known to those skilled in the art, as 
heretofore disclosed. 

The present invention also includes a test kit to 
5 identify a compound capable of inhibiting cysteine protease 
activity of a parasite. Such a test kit includes an isolated 
filariid nematode, preferably D, immitis, cysteine protease 
protein having cysteine protease activity and a means for 
determining the extent of inhibition of cysteine protease 

10 activity in the presence of (i.e,, effected by) a putative 
inhibitory compound. 

Inhibitors isolated by such a method, and/or test kit, 
can be used to inhibit any cysteine protease that is 
susceptible to such an inhibitor. Preferred cysteine protease 

15 enzymes to inhibit are those produced by parasites, A 
particularly preferred cysteine protease inhibitor of the 
present invention is capable of protecting an animal from 
heartworm. It is also within the scope of the present 
invention to use inhibitors of the present invention to target 

20 cysteine protease-related disorders in animals. Therapeutic 
compositions comprising cysteine protease-inhibitory compounds 
of the present invention can be administered to animals in an 
effective manner to protect animals from disease caused by the 
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targeted cysteine protease. Effective amounts and dosing 
regimens can be determined using techniques known to those 
skilled in the art. 

The efficacy of a therapeutic composition of the present 
5 invention to protect an animal from disease caused by a 
parasite can be tested in a variety of ways including, but not 
limited to, detection of protective antibodies (using, for 
example, proteins or mimetopes of the present invention) , 
detection of cellular immunity within the treated animal, or 
y 10 challenge of the treated animal with the parasite to determine 
whether the treated animal is resistant to disease. Such 
techniques are known to those skilled in the art. 
2f In accordance with the present invention, the inventors 

have shown that protease inhibitors can inhibit parasite 
15 larval development. For example, bestatin and phosphoramidon 
H: have been shown to inhibit molting of D. immitis larvae, as 

^ described in more detail in the Examples. 

Another embodiment of the present invention includes the 
isolation of proteases, including metalloproteases and 
20 cysteine proteases from parasitic larval excretory/secretory 
(ES) products. Using a modified version of the protocol first 
described in U.S. Patent Application Serial No. 08/153,554, 
ibid., the inventors have, for example, isolated a fraction 
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comprising a protein of approximately 60 kD (as determined by 
Tris-glycine sodium dodecyl sulfate polyacrylamide gel 
electrophoresis (SDS-PAGE) that has metalloprotease activity 
as determined by the protein's ability to cleave H-Phe-AMC, 
5 When submitted to size-exclusion chromatography, the active 
fraction elutes with bovine serum albumin, indicating an 
approximate molecular weight of from about 62 to about 66 kD. 
The modified protocol, which is described in more detail in 
the Examples, includes submitting parasitic, preferably D, 

10 iimnitiSf larval ES, to anion exchange chromatography, followed 
by size exclusion chromatography and isoelectric focussing. 
The active fraction has a pi of about 6.8. 

In another embodiment, the inventors have identified at 
least one parasite metalloprotease from the ES that is capable 

15 of degrading collagen. For example, electrophoresis of D. 
immitls larval ES through a gelatin-based matrix leads to 
isolated active fractions that migrate with apparent molecular 
weights of about 60, 95 and at least about 200 kD. The 
proteolytic activity of such fractions is essentially 

2 0 completely inhibited by EDTA. 

It is also within the scope of the present invention to 
use isolated proteins, mimetopes, nucleic acid molecules and 
antibodies of the present invention as diagnostic reagents to 
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detect infection by parasites. Such diagnostic reagents can 
be supplemented with additional compounds that can detect 
other phases of the parasite's life cycle. 

The following examples are provided for the purposes of 
illustration and are not intended to limit the scope of the 
present invention . 

Examples 

The following examples include a number of recombinant 
DNA and protein chemistry techniques which are known to those 
skilled in the art; see, for example, Sambrook et al • , ibid. 

Example 1 

This Example discloses the cloning and sequencing of two 
parasite astacin metalloendopeptidase nucleic acid molecules 
of the present invention. This Example also discloses the 
production of a recombinant molecule and recombinant cell of 
the present invention. 

A D. immitis third stage larvae cDNA expression library 
was prepared in the following manner. Total RNA was extracted 
from D. immitis third stage larvae (L3) using an acid- 
guanidinium-phenol-chloroform method similar to that described 
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by Chomczynski and Sacchi, 1987, Anal. Biochem. 162, p, 
156-159. Approximately 230, 000 L3 were used in the RNA 
preparation. Poly A+ selected RNA was separated from total 
RNA by oligo-dT cellulose chromatography using Oligo dT 
5 cellulose from Collaborative Research Inc., Waltham, MA, 
according to the method recommended by the manufacturer. 

The expression library was constructed by inserting the 
L3 poly A+ RNA into the expression vector lambda (A) Uni-ZAP'^ 
XR (available from Stratagene Cloning Systems, La Jolla, CA) 

10 using Stratagene 's ZAP-cDNA Synthesis Kitcg) protocol and about 
6-7 /^g of L3 poly A+ RNA. The resultant library was amplified 
to a titer of about 4.88 x 10^ pfu/ml with about 96% 
recombinants. Ten minilibraries were generated by one further 
round of amplification of randomly selected aliquots of the 

15 original L3 cDNA amplified library. These minilibrary phage 
were collected in phage dilution buffer (e.g., 10 mM Tris-HCl, 
pH 7.5, 10 mM magnesium sulfate) and stored at 4°C. 

A D. immitis astacin metalloendopeptidase nucleic acid 
molecule of about 689 nucleotides, representing a partial D. 

20 immitis astacin metalloendopeptidase gene and denoted 
nDiMPAlggg, was PCR amplified from D. immitis L3 cDNA 
expression minilibraries using the following two primers: a 4- 
fold degenerate primer having SEQ ID NO: 14, namely 5' 
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ACWCATGAAATIGSICAT 3' (denoted MPl; W can be A or T; S can be 
C or G; I is inosine) and an antisense oligonucleotide having 
SEQ ID NO: 15, namely 5 ' AATACGACTCACTATAG 3' (denoted T7). 
Primer MPl was designed from published sequences of the 
5 metalloprotease conserved zinc binding domain. Primer T7 is 
complementary to the pBluescript® vector (available from 
Stratagene) . 

A nucleic acid molecule amplified from minilibrary number 
10, which was denoted nDlMPAlggg, was gel-purified, 

10 electroeluted and cloned into the cloning vector pCRII 
(available from Invitrogen, San Diego, CA) following 
manufacturer's instructions, thereby forming recombinant 
vector pCRII-nDiMPAlggg • The nucleotide sequence of nDlMPAlgsg 
(characterized by a coding strand having the nucleic acid 

15 sequence of SEQ ID NO: 20) was determined and found to include 
nucleotides spanning from about nucleotide position 610 
through the 3' end of SEQ ID N0:1, the production of which is 
described in more detail below. 

The L3 cDNA minilibrary number 10 was screened with the 

20 radiolabeled MPl oligonucleotide as a probe, using stringent 
(i , e . , standard) hybridization conditions as described in 
Sambrook et al., ibid. Plaques which hybridized to the probe 
were rescreened and plaque-purified. The plaque-purified clone 
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including D. immitis nucleic acid molecule nDiMPA1^299 ^^s 
converted into a double stranded recombinant molecule, herein 
denoted as p(3gal-nDiMPAli299A using R408 helper phage and 
XLl-Blue E. coll according to the in vivo excision protocol 
5 described in the Stratagene ZAP-cDNA Synthesis Kit®. Double- 
stranded plasmid DNA was prepared using an alkaline lysis 
protocol, such as that described in Sambrook et al., ibid. 
Recombinant molecule ppgal-nDiMPAl^sgg was transformed into E. 
coli to form recombinant cell E. coii :p|3gal-nDiMPA1^299 - 

10 Recombinant molecule p3gal-nDiMPAl-^299 was submitted to 

nucleic acid sequencing using the Sanger dideoxy chain 
termination method, as described in Sambrook et al,, ibid. An 
about 1299 nucleotide consensus sequence of nucleic acid 
molecule nDiMPAl:L299 was determined and is presented as SEQ ID 

15 N0:1, SEQ ID N0:1 apparently encodes three overlapping open 
reading frames. The first open reading frame, denoted 
PDiMPAloj^^i, is about 191 amino acids (presented in SEQ ID 
NO: 3) and encompasses about nucleotide number 18-590 of SEQ ID 
N0:1, The second open reading frame, denoted PDiMPAloRF2/ is 

20 about 141 amino acids (presented in SEQ ID NO: 4) and 
encompasses about nucleotide number 508-930 of SEQ ID:1, Open 
reading frame PDiMPAloRF2 includes the extended zinc binding 
domain and hydrophilic region HEIGHTLGIFHE beginning at about 
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amino acid 36 of SEQ ID NO: 4, as well as the domain YDTGSVMHY 
(beginning at about amino acid position 87) that includes the 
tyrosine that is thought to bind to zinc at about amino acid 
position 95. The third open reading frame^ denoted 
5 PDiMPAloRF3/ is about 121 amino acids (presented in SEQ ID NO: 
5) and encompasses nucleotide number 785-1147 of SEQ ID:1. 

A homology search of the non-redundant protein sequence 
database was performed through the National Center for 
Biotechnology Information using the BLAST network. This 

10 database includes +SwissProt + FIR + SPUpdate + GenPept + 
GPUpdate. The search, which was performed using SEQ ID NO: 3, 
SEQ ID NO: 4 and SEQ ID NO: 5, showed significant homology at 
the amino acid level by all three open reading frames to 
members of the astacin family of metalloendopeptidases . 

15 Significant homology throughout all three amino acid open 
reading frames is also associated with a C. elegans R151.5 
gene product, Genbank accession number U00036. 

It was apparent from the pBluescript vector sequences at 
and near the 5' end of nucleic acid molecule nDiMPAl-^299 that 

2 0 two cDNA fragments had ligated to each other via their 5' ends 
to form that nucleic acid molecule, A comparison of the 
nucleotide sequence of SEQ ID N0:1 to that of SEQ ID NO: 2 (the 
apparent nucleotide sequence of independently-isolated astacin 
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metal loendopeptidase nucleic acid molecule nDiMPA22i26 
described in Example 2) as well as to nucleotide sequences of 
other members the astacin family of metalloendopeptidases, 
showed that nucleotide number 56 of SEQ ID N0:1 corresponds to 
5 the first nucleotide on the 5' end of nDiMPA22i26 suggesting 
that nucleotide number 56 represents the junction of the two 
different cDNA sequences in nDiMPAli299 - Therefore, 
nucleotides 1 through 55 of nDiMPAli299 most likely represent 
a non-astacin related cDNA sequence which ligated to the 5' 
10 end of the astacin metalloendopeptidase cDNA fragment prior to 
cloning. Astacin homology at the amino acid level can be 
found starting with amino acid number 125 of the first open 
reading frame of nDiMPAl:^299^ namely PDIMPAIorfi* 



Example 2 

15 This Example discloses the cloning and sequencing of an 

additional parasite astacin metalloendopeptidase nucleic acid 
molecule of the present invention. This Example also 
discloses the production of a recombinant molecule and 
recombinant cell of the present invention, 

20 Due to the unusual sequences on the 5' end of astacin 

metalloendopeptidase nucleic acid molecule nDiMPAl-L299/ a 
second astacin metalloendopeptidase nucleic acid molecule. 



98 



2618~21-1-C1 

denoted nDiMPA22i26/ ^as isolated from the L3 cDNA expression 
library described in Example 1, as follows • 

A D. immitis astacin metalloendopeptidase nucleic acid 
molecule of about 271 nucleotides, denoted nDiMPA227i 
5 (characterized by a coding strand having the nucleic acid 
sequence of SEQ ID NO:22), was PGR amplified from D. immitis 
L3 cDNA expression minilibrary number 9 using the following 
two primers: an oligonucleotide having SEQ ID NO: 16, namely 5' 
TGGTATTATATCACATGAAATTGGTCATAC 3' (denoted ZNSEN) and an 

10 antisense oligonucleotide having SEQ ID NO: 17, namely 5' 
CCCAATTGTGTACTGTTGAAATTTATCAC 3' (denoted MP14) . Primer ZNSEN 
was designed from the nucleotide sequence encoding the 
metalloprotease conserved zinc binding domain in nDiMPAl-L299 
and spans from about nucleotide 600 through about nucleotide 

15 629 of SEQ ID N0:1, Primer MP14 is an antisense primer 
complementary to a region spanning from about nucleotide 8 42 
through about nucleotide 87 0 of nDiMPA1^299 » 

Nucleic acid molecule nDiMPA227a was radiolabeled and used 
as a probe to screen L3 cDNA minilibrary number 9. Plaques 

20 which hybridized under stringent hybridization conditions to 
the probe were rescreened and plaque purified. A 
plaque-purified clone including D. immitis nucleic acid 
molecule nDiMPA22i26/ converted into a double-stranded 
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recombinant molecule^ herein denoted as p3gal-nDiMPA22i26/ 
using R408 helper phage and XLl-Blue E. coli according to the 
in vivo excision protocol described in the Stratagene ZAP-cDNA 
Synthesis Kit®. Double-stranded plasmid DNA was prepared 
5 using an alkaline lysis protocol^ such as that described in 
Sambrook et al., ibid. Recombinant molecule p(3gal-nDiMPA22i26 
was transformed into E. coli to form recombinant cell E. 
coli : p3gal-nDiMPA22i26 • 

Recombinant molecule p3gal-nDiMPA22i26 was submitted to 

10 nucleic acid sequencing using the Sanger dideoxy chain 
termination method^ as described in Sambrook et al . , ibid. An 
about 2126-nucleotide consensus sequence of nucleic acid 
molecule nDiMPA22i26 was determined and is presented as SEQ ID 
NO: 2, SEQ ID NO: 2 apparently encodes five overlapping open 

15 reading frames. The first open reading frame, denoted 
PDiMPA2oRn/ is about 178 amino acids (presented in SEQ ID 
NO: 6) and encompasses about nucleotide numbers 2-535 of SEQ 
ID: 2. The second open reading frame, denoted PDiMPA2oRF2A is 
about 145 amino acids (presented in SEQ ID NO: 7) and 

20 encompasses about nucleotide numbers 453-887 of SEQ ID: 2. 
PDiMPA2oj^F2 includes the extended zinc binding domain, 
beginning at about amino acid 3 6 of SEQ ID NO: 7 as well as the 
tyrosine-containing motif, beginning at about amino acid 87. 
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The third open reading frame ^ denoted PDiMPA2oRF3A is about 134 
amino acids (presented in SEQ ID NO: 8) and encompasses 
nucleotide number 730-1131 of SEQ ID: 2, The fourth open 
reading frame, denoted PDiMPA2oRF4/ is about 154 amino acids 
5 (presented in SEQ ID NO: 9) and encompasses nucleotide number 
1112-1573 of SEQ ID: 2. The fifth open reading frame, denoted 
PDiMPA2oRF5/ is about 163 amino acids (presented in SEQ ID 
NO: 10) and encompasses nucleotide number 1429-1917 of SEQ 
ID: 2 . 

10 A comparison of the deduced nucleic acid sequences of 

nDiMPAli299 (SEQ ID N0:1) and nDiMPA22^25 (SEQ ID NO: 2) indicates 
that nDiMPA22i26 does not contain the stretch of nucleotides 
from about positions 1 through 55 of nDiMPA1^299 (^s numbered 
in SEQ ID N0:1) . As discussed above, it is believed that this 

15 stretch of nucleotides represents an unrelated cDNA clone that 
ligated to the 5' end of the astacin metalloendopeptidase 
nucleic acid molecule. The stretch of nucleotides spanning 
from about positions 56 through 907 of nDiMPAli299 (as numbered 
in SEQ ID N0:1) share 100% homology with the stretch of 

20 nucleotides spanning from about positions 1 through 852 of 
nDiMPA22i26 (as numbered in SEQ ID N0:2) . The stretch of 
nucleotides spanning from about positions 908 through 970 of 
nDiMPA1^299 (as numbered in SEQ ID N0:1) are missing from 
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nDiMPA22i26- The stretch of nucleotides spanning from about 
positions 971 through 1133 of nDiMPA1^299 (^s numbered in SEQ 
ID N0:1) share 100% homology with the stretch of nucleotides 
spanning from about positions 853 through 1015 of nDiMPA22i26 
5 (as numbered in SEQ ID N0:2). Nucleic acid molecule 
nDiMPA22i26 has a significantly longer 3' end than does 
nDiMPAl:L299 - 

A homology search of the non-redundant protein sequence 
database was performed through the National Center for 

10 Biotechnology Information using the BLAST network. This 
database includes +SwissProt + PIR + SPUpdate + GenPept + 
GPUpdate, The search, which was performed using SEQ ID NO: 7, 
SEQ ID NO: 8, SEQ ID NO: 9 and SEQ ID NO: 10 and SEQ ID NO: 11, 
showed that SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9 and SEQ ID 

15 NO: 10 each shared significant homology at the amino acid level 
with known members of the astacin family of 
metalloendopeptidases . SEQ ID NO: 11, in contrast did not show 
significant homology to a known astacin metalloendopeptidase . 
A composite D, immitis ORE of the five open reading 

2 0 frames encoded by nDiMPA22i26 produced by lining up the 

five ORFs in relation to known astacin metalloendopeptidase 
sequences. The composite D. immitis ORE, which spans a region 
significantly larger than the 200-amino acid astacin protein. 
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is presented in SEQ ID NO: 11* A comparison between the 
astacin domain of SEQ ID NO: 11 and crayfish astacin showed 
about 29% homology at the amino acid level. The astacin 
domain of SEQ ID NO: 11 also shared about 30 percent, 31 
5 percent, 33 percent and 33 percent homology at the amino acid 
level with the astacin domains of, respectively, human bone 
morphogenetic protein 1, mouse kidney brush border 
metalloendopeptidase, human intestinal brush border 
metalloendopeptidases and Xenopus Jaevis embryonic protein 
10 UVS,2. 

Comparison of SEQ ID NO: 11 and the C. elegans R151.5 gene 
product, (Genbank accession niimber U00036) showed an about 24% 
homology between the two sequences. The C. elegans gene 
product also includes a well-conserved extended zinc binding 

15 domain motif and tyrosine-containing motif. It is interesting 
that although the C. elegans R151.5 gene product was 
identified by Wilson et al., ibid. ^ as an open reading frame, 
the authors of that publication describing a 2.2 megabase 
contiguous nucleotide sequence from the free-living nematode 

20 C. elegans failed to appreciate the homology between R151.5 
and the family of astacin metalloendopeptidases. The present 
inventors are apparently the first to note such a homology and 
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the likelihood that C. elegans encodes an astacin 
metalloendopeptidase • 

Example 3 

This Example discloses the production of a recombinant 
5 cell of the present invention and its use to produce a 
parasitic astacin metalloendopeptidase protein of the present 
invention. 

Recombinant molecule ptrcHis-nDiMPA2804^ containing 
nucleotides from about positions 119 through 922 of nDiMPA22i26 

10 (as numbered in SEQ ID NO: 2, the sequence of nDiMPA2804 
characterized by a coding strand having the nucleic acid 
sequence of SEQ ID NO: 21) operatively linked to trc 
transcription control sequences and to a fusion sequence 
encoding a poly-histidine segment comprising 6 histidines was 

15 produced in the following manner. An about 8 04-nucleotide DNA 
fragment containing nucleotides spanning from about 119 
through about 922 of nDiMPA22i26 (as numbered in SEQ ID N0:2), 
denoted nDiMPA2804 was cleaved from recombinant molecule 
p3gal-nDiMPA22i26A with BamEl restriction endonuclease, gel 

20 purified and subcloned into expression vector pTrcHisB 
(available from Invitrogen) that had been cleaved with BamEl. 
The resulting recombinant molecule ptrcHis-nDiMPA2804 was 

104 



2eiB-21-1-C1 

transformed into E. coli to form recombinant cell E. 
coli : ptrcHis-nDiMPA2804 • 

Recombinant cell E. coli :ptrcHis-nDiMPA2804 is cultured in 
shake flasks containing an enriched bacterial growth medium 
5 containing about 0.1 mg/ml ampicillin at about 37° C. When 
the cells reach an OB^qq of about 0,3, expression of E. 
coli :ptrcHis-nDiMPA2804 is induced by addition of about 1 mM 
isopropyl-p-D-thiogalactoside (IPTG), and the cells cultured 
for about 3 hours at about 37°C. Protein production is 

10 monitored by SDS-PAGE of recombinant cell lysates, followed by 
Coomassie blue staining, using standard techniques. 
Recombinant cell E. coli :ptrcHis-nDiMPA2804 produces a fusion 
protein, denoted herein as PHIS-PDiMPA2804/ that is not 
produced by cells transformed with the pTrcHisB plasmid 

15 lacking a parasite nucleic acid molecule insert. 



Example 4 

This Example discloses the production of another 
recombinant cell of the present invention capable of producing 
a parasitic astacin metalloendopeptidase protein of the 
2 0 present invention . 

Recombinant molecule pXPRHis-nDiMPA2804^ containing 
nucleotides from about positions 119 through 922 of nDiMPA22a26 
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(as numbered in SEQ ID NO: 2) operatively linked to AP^ 
transcription control sequences and to a fusion sequence 
encoding a poly-histidine segment comprising 6 histidines was 
produced in the following manner • Nucleic acid molecule 
5 nDiMPA2804/ produced as described in Example 3, was ligated 
into the BamEl restriction site of the XPr/T7/RSET-B 
expression vector. The vector, which is about 3455 base pairs 
(bp) , contains an about 1990-bp pair Pvull to Aatll fragment 
from pUC19 containing the ampicillin resistance gene and E. 

10 coii origin of replication; an about 1100 bp Bglll to Bglll 
DNA fragment from vector pRK248cIts (available as ATCC #33766) 
with a PmuII linker added to one end, containing the XP^ 
promoter, the cl^^"^ X repressor gene and 22 amino acids of the 
cro gene regulating lytic growth; an about 55-bp Bglll to Xbal 

15 segment from pGEMEX-1 (available from Promega, Madison, WI) 
which contains the T7 promoter; an about 170-bp Xbal to EcoRl 
segment from pRSET-B (available from Invitrogen) which 
contains the T7-S10 translational enhancer, the His6 fusion, 
the 11 amino acid SIO leader fusion, an enterokinase cleavage 

20 site and the multiple cloning site; and an about 140-bp 
fragment containing synthetic translational and transcription 
termination signals including the translation terminators 
in all three reading frames, RNA stabilization sequence from 
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Bacillus thurengiensis crystal protein and the 
rho- independent transcription terminator from the trpA operon. 
The resulting recombinant molecule, denoted pXPRHis-nDiMPA2804/ 
was transformed into coll to form recombinant cell E. 

coii :pXPRHis-nDiMPA2804 . 



Example 5 

This Example describes another recombinant cell of the 
present invention and its use to produce a parasitic astacin 
metalloendopeptidase protein of the present invention. 

Recombinant molecule pBBIII-nDiMPA22i26/ containing 
nucleic acid molecule nDiMPA22i26 (produced as described in 
Example 2) operatively linked to baculovirus polyhedron 
transcription control sequences was produced in the following 
manner , In order to produce a baculovirus recombinant 
molecule capable of directing the production of the protein 
encoded by nDiMPA22i26/ recombinant molecule p3gal-nDiMPA22i26/ 
produced as described in Example 2, was digested with Xhol, 
end-filled with Klenow DNA Polymerase, digested with PstI, gel 
purified and referred to as BvMPA2 . The baculovirus shuttle 
plasmid, BlueBacIII (BBIII) (available from Invitrogen) was 
digested with Ncol, end-filled, digested with PstI and treated 
with calf intestinal phosphatase. The resulting vector 
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fragment was gel purified and ligated to BvMPA2 . The 
resultant recombinant molecule, denoted pBBIII-nDiMPA22i26A was 
verified for proper insert orientation by restriction mapping. 
This construct and linear Baculogold baculovirus DNA 
5 (Pharmingen) were cotransf ected into Spodoptera frugiperda Sf9 
host cells (donated by Colorado Bioprocessing Center, Fort 
Collins, CO) • The resulting recombinant virus termed BvMPA, 
was cultivated for increased production of recombinant virus 
and to verify expression of nDiMPA22i26 by Western blot. 

10 Example 6 

This Example describes the cloning and sequencing of 
additional parasite astacin metalloendopeptidase nucleic acid 
molecules of the present invention. This Example also 
discloses the production of recombinant molecules of the 
15 present invention . 

Due to the unusual overlapping reading frames within both 
previously isolated astacin L3 cDNA clones, an astacin 
metalloendopeptidase nucleic acid molecule, denoted L3 
nDiMPA32292 was isolated from the L3 cDNA expression library 
20 described in Example 1, as follows. 

A D. immitis astacin metalloendopeptidase nucleic acid 
molecule of about 341 nucleotides, denoted nDiMPA234^ 
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(represented by nucleotides 504 through 844 of SEQ ID NO: 2) 
was PGR amplified from p3gal-nDiMPA22i26 using the following 
two primers: an oligonucleotide having the following 
sequence : 5 ' GTCGGATCCGCAGGAGGGAATTTCAATTTCAACA-3 ' (denoted 
5 Astacin l^" SEQ ID NO: 25) and an antisense oligonucleotide have 
the following sequence: 5 ' TCAAGATCTAATCCAGAAATGATGGCCCTTCACG 
3' (denoted Astacin 1" SEQ ID NO: 26) . The primers Astacin 1+ 
and Astacin 1" were designed based on the nucleotide sequence 
encoding regions surrounding the conserved zinc binding domain 

10 and hydrophilic region of the molecule nDiMPA22i26 described in 
Example 2 above^ the consensus nucleic acid sequence of which 
is denoted SEQ ID NO: 2. The portion of SEQ ID NO: 2 encoding 
a zinc binding domain and hydrophilic region spans from about 
nucleotide 558 through about nucleotide 614. Primer astacin 

15 1"^ (SEQ ID NO: 25) was designed from the nucleotide sequence of 
nDiMPA22i26 cind spans from about nucleotide 504 through about 
nucleotide 527 of SEQ ID NO: 2, Primer astacin 1" (SEQ ID 
NO: 2 6) is an antisense primer complementary to a region 
spanning from about nucleotide 819 through about nucleotide 

20 844 of SEQ ID N0:2. 

Nucleic acid molecule nDiMPA234i was radiolabeled and used 
as a probe to screen the L3 cDNA library. Plaques which 
hybridized under stringent hybridization conditions (See for 
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example, Sambrook et al • , supra, and/or Meinkoth et al . , 
supra) to the probe were isolated and rescreened by PGR 
analysis using phage vector primers that flank the vector 
multiple cloning site containing the D. immitis insert cDNAs . 
5 These primers included an oligonucleotide having the following 
sequence: 5 ' GGAAACAGCTATGACCATG3 ' (denoted Ml 3 rev, SEQ ID 
NO:27), and an antisense oligonucleotide having the following 
sequence: 5 ' GTAAAACGACGGCCAGT3 ' (denoted M13 univ, SEQ ID 
NO:28). The phage corresponding to the largest PGR product 

10 which hybridized under stringent hybridization conditions to 
the astacin probe was rescreened and plaque purified. A 
plaque-purified clone including D. immitis nucleic acid 
molecule L3 nDiMPA32292 was converted into a double-stranded 
recombinant molecule, herein denoted as p3gal-L3-nDiMPA32292/ 

15 using Exassist helper phage and SOLR E. coli according to the 
in vivo excision protocol described in the Stratagene ZAP-cDNA 
Synthesis Kit®, Double stranded plasmid DNA was prepared 
using an alkaline lysis protocol, such as that described in 
Sambrook . et a J . ^ ibid. Recombinant molecule p3gal-L3- 

2 0 nDiMPA32292 was transformed into E, coli to form recombinant 
cell E. coli :p3gal-L3-nDiMPA32292 - 

Recombinant molecule pBgal-L3-nDiMPA32292 was submitted to 
nucleic acid sequencing using the Sanger dideoxy chain 
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termination method, as described in Sambrook et aJ,^ ibid. An 
about 22 92-nucleotide consensus sequence of the coding strand 
of nucleic acid molecule nDiMPA32292 was determined and is 
presented herein as SEQ ID NO:29, SEQ ID NO:29 apparently 
5 encodes a single open reading frame. This open reading frame, 
denoted L3 nDiMPAS^ove (SEQ ID N0:30), encodes a protein, 
denoted PDiMPASggs/ that is about 692 amino acids long 
(presented as SEQ ID N0:31), SEQ ID NO:30 encompasses 
nucleotide nimbers from about 72 through about 2147 of SEQ ID 
10 NO:29. 

A comparison of the deduced nucleic acid sequences of 
nDiMPAli299 (SEQ ID N0:1) and L3 nDiMPA32292 (SEQ ID NO:29), 
indicates that L3 nDiMPA32292 does not contain the stretch of 
nucleotides from about positions 1 through 15 of nDlMPAl^sgg 

15 (as numbered in SEQ ID N0:1), As discussed above with regard 
to the comparison of SEQ ID N0:1 and SEQ ID NO: 2, it is 
believed that this stretch of nucleotides in SEQ ID N0:1 
represents an unrelated cDNA clone that ligated to the 5' end 
of the astacin metalloendopeptidase nucleic acid molecule. 

20 Additionally, the stretch of nucleotides spanning from about 
positions 908 through 970 of nDiMPAl:L299 (as numbered in SEQ ID 
N0:1) are missing from L3 nDiMPA32292 and nDiMPA22i26. 
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A comparison of the deduced nucleic acid sequences of 
nDiMPA22i26 (SEQ ID NO: 2) and L3 nDiMPA32292 (SEQ ID NO: 29)^ 
indicates that the stretch of nucleotides spanning from about 
positions 1 through 510 of nDiMPA22i26 (as numbered in SEQ ID 
5 NO : 2 ) share greater than 99% homology with the stretch of 
nucleotides spanning from about positions 14 6 through 655 of 
L3 nDiMPA32292 (as numbered in SEQ ID NO:29). The stretch of 
nucleotides spanning from about positions 511 through 2126 of 
nDiMPA22i26 (as numbered in SEQ ID NO: 2) share greater than 99% 

10 homology with the stretch of nucleotides spanning from about 
positions 664 through 2282 of L3 nDiMPA32292 (as numbered in 
SEQ ID NO:29). nDlMPAl^sgg (SEQ ID N0:1) and nDiMPA22i26 (SEQ ID 
NO: 2) do not contain the stretch of nucleotides from about 
positions 656 through 563 of L3 nDiMPA3 2292 (as numbered in SEQ 

15 ID NO:29). Additionally, nDiMPA22i26 does not contain 
nucleotides at positions 1261, 1264 and 1715 of L3 nDiMPA32292 
(as numbered in SEQ ID NO:29); and L3 nDiMPA32292 does not 
contain the nucleotide at position 852 of nDiMPA22i26 (as 
numbered in SEQ ID N0:2) , 

2 0 A homology search of the non-redundant protein sequence 

database was performed through the National Center for 
Biotechnology Information using the BLAST network. This 
database includes +SwissProt + PIR + SPUpdate + GenPept + 
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GPUpdate, The search, which was performed using SEQ ID NO: 31, 
showed that SEQ ID NO: 31 shared significant homology at the 
amino acid level with known members of the astacin family of 
metalloendopeptidases . A comparison between the astacin 
5 domain of SEQ ID NO: 31 (amino acid positions from about 122 
through 32 6) and crayfish astacin showed about 27,3% homology 
at the amino acid level. The astacin domain of SEQ ID NO: 31 
also shared about 31,7% and 34.1% homology at the amino acid 
level with the astacin domains of, respectively, quail astacin 
10 and the C. elegans R151.5 gene product, (Genbank accession 
number U00036) . SEQ ID N0:31 shows about 81.7% homology with 
the composite amino acid sequence derived from the five open 
reading frames encoded by nDiMPA22i26 (SEQ ID NO: 11) . 

Example 7 

15 This Example describes the cloning and sequencing of an 

adult parasite astacin metalloendopeptidase nucleic acid 

molecule of the present invention. 

Another astacin metalloendopeptidase nucleic acid 

molecule, denoted adult nDiMPA32032/ was isolated from an adult 
20 D. imiaitis cDNA expression library, the adult expression 

library being produced as described for the L3 cDNA library in 

Example 1, as follows. 
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An adult D. iimitis astacin metalloendopeptidase nucleic 
acid molecule was isolated by PGR from an adult male D. 
ixamitis cDNA expression library, using the sequence 
information obtained from SEQ ID NO:29. The primers used to 
5 amplify the adult nucleic acid molecule included an 
oligonucleotide having the following sequence: 
5 ' CATCTCGAGATCAGTGGAAAATTATCGAACG3 ' (SEQ ID NO: 35, also 
denoted as Asta+ and corresponding to nucleotides 119-141 of 
SEQ ID NO:29), and an antisense oligonucleotide having the 

10 following sequence: 5 ' ATTGAATTCACTTCTTTTTCGAGTCAGGCAA3 ' (SEQ 
ID NO: 36, also denoted as Astal and corresponding to 
nucleotides 2127-2150 of SEQ ID NO:29). A recombinant 
molecule containing a D. iimitis astacin metalloendopeptidase 
nucleic acid molecule, denoted adult nDiMPA32032/ was submitted 

15 to nucleic acid sequencing using the Sanger dideoxy chain 
termination method, as described in Sambrook et al., ihid. An 
about 2032-nucleotide consensus sequence of the coding strand 
of adult nDiMPA32032 determined and is presented as SEQ ID 

NO: 32. SEQ ID NO: 32 apparently encodes a single open reading 

20 frame, denoted adult nDiMPA32028 (SEQ ID NO:33). This open 
reading frame encodes a protein, denoted adult PDiMPA3676, that 
is about 67 6 amino acids long, the amino acid sequence of 
which is presented as SEQ ID NO: 34. SEQ ID NO: 33 encompasses 
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from about nucleotide numbers 2 through about 2029 of SEQ ID 
NO: 32. 

A comparison of the deduced nucleic acid sequences of L3 
nDiMPA32292 (SEQ ID NO: 29) and adult nDiMPA32032 (SEQ ID NO: 32) 
5 indicates that the nucleotides spanning from about positions 
119 through 2150 of L3 nDiMPA32292 (as numbered in SEQ ID 
NO : 29) share greater than 99% homology with the stretch of 
nucleotides spanning from about positions 1 through 2032 of 
adult nDiMPA32032 (as numbered in SEQ ID NO: 32). Apparent 

10 differences between the L3 and adult sequences occur at about 
nucleotide positions 593, 596, 607, 612, 661, 1456, and 1745 
of SEQ ID NO:29, and nucleotide positions 475, 478, 489, 494, 
543, 1338 and 1627 of SEQ ID NO: 32. These nucleotide 
differences result in apparent amino acid sequence differences 

15 at positions 179, 181, 197 and 462 of SEQ ID NO: 31, and 
positions 163, 165, 181 and 446 of SEQ ID NO:34. 

A homology search of the non-redundant protein sequence 
database was performed through the National Center for 
Biotechnology Information using the BLAST network. This 

20 database includes +SwissProt + PIR + SPUpdate + GenPept + 
GPUpdate. The search, which was performed using SEQ ID NO: 34, 
showed that SEQ ID NO: 34 shared significant homology at the 
amino acid level with known members of the astacin family of 
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metalloendopeptidases * A comparison between the astacin 
domain of SEQ ID NO: 34 (from about amino acid positions 122 
through 32 6) and crayfish astacin showed about 2 6.3% homology 
at the amino acid level. The astacin domain of SEQ ID NO: 34 
5 also shared about 31.2% and 34.6% homology at the amino acid 
level with the astacin domains of, respectively, quail astacin 
and the C. elegans R151,5 gene product, (Genbank accession 
number U00036) . SEQ ID NO:34 shows about 81.3% homology with 
the composite amino acid sequence derived from the five open 

10 reading frames encoded by nDiMPA22i26 (SEQ ID NO: 11). 

Comparison with the regions of homology in all known 
astacins (as discussed in detail above) , indicated that the 
amino acid sequences presented as SEQ ID NO: 31 (L3 PDiMPA3692A 
described above in Example 6) and SEQ ID NO: 34 (adult 

15 PDiMPA3g7g) contain three regions of homology which are 
conserved within about a 61 amino acid region of all known 
astacins. In L3 PDiMPA36g2 and adult PDiMPA3 576, these three 
regions span about a 60 amino acid sequence corresponding to 
amino acid positions 214 through 273 of L3 PDiMPA3g92 and 

20 positions 198 through 257 of adult PDiMPA367g (as numbered in 
SEQ ID N0:31 and SEQ ID NO:34, respectively). The first 
region of homology includes the zinc binding domain, which 
spans positions from about 214 through 224 of SEQ ID NO: 31 and 
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positions 198 through 208 of SEQ ID NO: 34. This first region 
includes three histidines which are present in all astacins 
for zinc binding (imidazole zinc ligands) at positions 214, 
218 and 224 of SEQ ID N0:31 and positions 198, 202 and 208 of 
5 SEQ ID NO: 34, and a glutairiate at position 215 of SEQ ID NO: 31 
and position 199 of SEQ ID NO: 34, which is assumed to be 
catalytically important in all astacins. In addition, this 
first region includes a glycine which is important for 
secondary structure of the protein at position 221 of SEQ ID 
y 10 NO: 31 and position 205 of SEQ ID NO: 34, and a glutamate which 
forms a salt bridge with the amino terminus of the mature 
astacin protein at position 225 of SEQ ID NO: 31 and position 
^ 209 of SEQ ID NO:34, 

The second region found in L3 PDiMPA3692 adult 
15 PDiMPA3g7g that is conserved in all known astacins spans amino 
^t' acid positions 228 through 232 of SEQ ID NO: 31 and positions 

^ 212 through 216 of SEQ ID NO: 34, This second region is a 

hydrophilic region common to all astacins. 

The third region found in L3 PDiMPAS^gs adult 
2 0 PDiMPA3g7g that is conserved in all known astacins spans amino 
acid positions 265 through 273 of SEQ ID NO: 31 and positions 
24 9 through 257 of SEQ ID NO: 34, and contains a portion of the 
zinc binding domain. In particular, the tyrosine at position 
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273 of SEQ ID NO: 31 and position 257 of SEQ ID NO: 34 is the 
fourth amino acid zinc ligand. In many astacins, this 
tyrosine is typically at position 61 from the first amino acid 
of the zinc binding domain (i-e», 61 amino acids from the 
5 first histidine in the first region) • In L3 PDiMPA3g92 and 
adult PDiMPA3676, this tyrosine is at position 60 from the 
first amino acid of the zinc binding domain (i.e., 60 amino 
acids from the first histidine in the first region at position 
214 of SEQ ID NO: 31 and position 198 of SEQ ID NO: 34) , 

10 Example 8 

This Example describes the cloning and sequencing of a 
filariid nematode cysteine protease nucleic acid molecule of 
the present invention. 

A D. immitis cysteine protease nucleic acid molecule of 

15 about 143 nucleotides^ denoted nDiCPi43, representing a partial 
D. immitis cysteine protease gene, was PGR amplified from D. 
immitis genomic DNA that had been extracted from adult female 
D. immitis worms using standard protocols similar to that 
described in Sambrook et al, ibid. The two primers used in 

20 the PGR amplification reaction included a 4-fold degenerate 
primer having SEQ ID NO: 18, namely 5' 

CGGGATCCTGTGGWTCATGYTGGGG 3' (denoted 25C/ BamHl site in bold; 
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W is A or T; Y is C or T) and an 8-fold degenerate antisense 
primer having SEQ ID No: 19, namely 5' TAICCICCRTTRCAICCYTC 3^ 
(denoted G65; R is A or G; Y is C or T; I is inosine) , Both 
primers were designed from published sequence of cysteine 
5 proteases. Primer 25C was further refined in that D. immitis 
codon bias was incorporated into 25C to reduce the degeneracy. 
The inventors found such codon bias was necessary to 
effectively isolate D. iimitis cysteine protease nucleic acid 
molecules of the present invention. 

10 The amplified PGR fragment, namely nDiCP-L43/ gel 

purified and cloned into the pCRII cloning vector (available 
from Invitrogen, San Diego, CA) , following manufacturer's 
instructions. An about 143 nucleotide sequence of nDiCPi43 was 
determined and is presented as SEQ ID NO: 12. SEQ ID NO: 12 

15 apparently encodes a protein of about 47 amino acids, which is 
presented as SEQ ID NO: 13. The translation initiation site of 
the protein and the translation termination codon are not 
contained within this genomic clone. 

A homology search of the non-redundant protein sequence 

20 database was performed through the National Center for 
Biotechnology Information using the BLAST network. This 
database includes +SwissProt + PIR + SPUpdate + GenPept + 
GPUpdate. The search was performed using SEQ ID NO: 13 and 
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showed significant homology to numerous cysteine proteinases. 
The highest scoring matches at the amino acid level include 
soybean probable thiol protease precursor (Genbank accession 
number P22895) , barley cysteine proteinase EP-B 1 precursor 
5 (Genbank accession number P25249) and barley cysteine 
proteinase EP-B 4 precursor (Genbank accession number P25250) . 
Parasite specific cysteine proteases having homology to SEQ ID 
NO: 13 include cysteine proteases from Trypanosoma brucei 
(Genbank accession numbers S07051 and S12099) , Leishmania 
^ 10 pifanoi (Genbank accession number B48566) , i. mexicana 
Q (Genbank accession number S25003) , T. congolense (Genbank 

III accession number (37048) ;r and Trichomonas vaginalis (Genbank 

accession number X77220) . SEQ ID NO: 13 shared about 16 
D percent, about 22 percent, about 24 percent, about 35 percent, 

□ 15 about 39 percent, about 44 percent and about 49 percent 
Jl homology at the amino acid level with cysteine proteases from, 

respectively, H. contortus (a nematode) , Schistosoma mansoni 
(a trematode) , C, elegans (a nematode) , Fasciola hepatica (a 
trematode) , Entamoeba histolytica (a protozoa) , Trypanosoma 
20 cruzi (a protozoa) and T. brucie. SEQ ID NO: 13 also shared 
about 50 percent amino acid homology with human cathepsin, 
SEQ ID NO: 13 also shared about 56 percent amino acid homology 
with a Paragonimus westermani cysteine protease reported in 
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European Patent Application Publication No. 0524834A2, by 
Hamajima et al.^ published January 27, 1993. The serine at 
about position 30 and the cysteine at about position 37 of SEQ 
ID NO: 13 were conserved in all of these cysteine proteases. 
5 Note that these homology calculations did not include the 
amino acids encoded by DNA primers SEQ ID NO: 18 and SEQ ID 
NO: 19. As such, the region used in the homology calculations 
spanned from about amino acid position 6 through 41 of SEQ ID 
N0:13. 

10 Example 9 

This Example discloses the production of a recombinant 
cell of the present invention and its use to produce a 
filariid nematode cysteine protease protein of the present 
invention. 

15 Recombinant molecule ptrcHis-nDiCPi42/ containing 

nucleotides from about positions 2 through 143 of nDiCPi43 (as 
numbered in SEQ ID NO: 12) operatively linked to trc 
transcription control sequences and to a fusion sequence 
encoding a poly-histidine segment comprising 6 histidines was 

2 0 produced in the following manner. An about 142-nucleotide DNA 
fragment containing nucleotides spanning from about 2 through 
about 143 of nDiCPi43 (as numbered in SEQ ID N0:12), denoted 
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nDiCPi42 (characterized by a coding strand having the nucleic 
acid sequence of SEQ ID NO: 23), was sequentially digested from 
nDiCP;^43 with BamEl restriction endonuclease followed by 
digestion with EcoRI . Nucleic acid molecule nDiCPi42 was gel 
5 purified and directionally subcloned into expression vector 
pTrcHisA (available from Invitrogen) that had been cleaved 
with BairMl and EcoRl and subsequently been gel purified. The 
resulting recombinant molecule, namely ptrcHis-nDiCP;L42/ was 
transformed into E. coii to form recombinant cell E, 

10 coii :ptrcHiS"nDiCPi42. 

Recombinant cell E. coii :ptrcHis-nDiCP242 is cultured in 
shake flasks containing an enriched bacterial growth medium 
containing about 0.1 mg/ml ampicillin at about 37° C. When 
the cells reach an ODgoo of about 0.3, expression of E. 

15 coii :ptrcHis-nDiCPi42 is induced by addition of about 1 mM 
isopropyl-p-D-thiogalactoside (IPTG), and the cells cultured 
for about 3 hours at about Sl^'C. Protein production is 
monitored by SDS-PAGE of recombinant cell lysates, followed by 
Coomassie blue staining, using standard techniques. 

20 Recombinant cell E, coii :ptrcHis-nDiCP-L42 produces a fusion 
protein, denoted herein as PHIS-PDiCPi42/ the deduced amino 
acid sequence of which is presented as SEQ ID NO: 24, that is 
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not produced by cells transformed with the pTrcHisA plasmid 
lacking a filariid nucleic acid molecule insert. 



Example 10 

This Example demonstrates that the protease inhibitors 
5 bestatin and phosphoramidon are able to inhibit immitis 
larval development, particularly molting. Bestatin (available 
from Enzyme Systems Products;. Livermore/. CA) primarily;, if not 
exclusively, inhibits amino peptidases and other 
exopeptidases . Phosphoramidon (also available from Enzyme 

1 0 Systems Products ) specifically inhibits thermolysin and 
collagenase as well as metalloendoproteases from Bacillus 
subtiliSr Streptomyces griseus and Pseudomonas aeruginosa 
microorganisms . 

D. immitis larvae were cultured in NI media as described, 

15 for example, in U.S. Patent Application Serial No. 08/153,554, 
ibid. NI medium contains a 1:1 mixture of NCTC-135 and 
Iscove ' s modified Dulbecco medium (available from Sigma 
Chemical Co., St. Louis, MO), 20% SeruMax, 2.5 micrograms (jig) 
of amphotericin B per ml, 0.1 nanograms (ng) of gentamicin per 

20 ml, 50 ]ag of sulfadiazine per ml and 10 pg of trimethoprim per 
ml. Larvae, at a concentration of about 200 L3 per milliliter 
(ml) of NI medium, were distributed in 0.5-ml aliquots into 
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the wells of a 2 4 -well plate • The volume in each well was 
adjusted to 1 ml using NI mediim and 10-millimolar (mM) stocks 
of bestatin or phosphoramidon, each of which was dissolved in 
NI media . The final concentration of inhibitor in the culture 
5 wells was either 0, 1, 2,5 or 5 mM, Larvae were incubated at 
about 37°C, 5% C02 and 95% relative humidity. Larvae were 
observed daily and the percent molt (% molt) was evaluated at 
72 hours. The percent molt was calculated for each well by 
dividing the number of cuticles by the number of larvae per 
10 well and multiplying by 100. There were three wells for each 
inhibitor concentration and six wells for the untreated 
control. The results of this study appear in Table 1. 

Table 1 



Effect of Protease Inhibitors on Larval Molting 



Group 


Concentrat 


% molt 


s.d. 
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Control 


NA 


82. 6 


15. 8 


NA 


NA 


NA 


Bestatin 


1.0 rtiM 


60. 0 


9,0 


27.3 








2.5 itiM 


34, 8 


4.3 


57,9 








5.0 mM 


4.9 


0.9 


94.1 


★ 




Phosphoramidon 


1.0 mM 


42. 6 


2.1 


48.4 


•k 


■k 




2.5 mM 


12.3 


1,2 


85.1 


-k 


•k 




5. 0 mM 


0.0 


0,0 


100. 0 


•k 


■k 



The results indicate that treatment of L3 by bestatin and 
2 0 phosphoramidon significantly reduces the ability of the larvae 
to molt. An analysis of variance was performed using % molt. 
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The difference overall was significant (P = 0.0001). Fisher 
PLSD (Fi) and Scheffe F-test (Sc) multiple comparisons 
comparing each group to the control were done after the ANOVA 
represent significant differences of p < 0,05, NA = not 
5 applicable) . 

It was also observed that, in general, bestatin-treated 
larvae moved much more slowly throughout the study compared to 
controls whereas phosphoramidon-treated larvae were very 
active compared to controls. Cuticle separation appeared to 

10 be occurring in the phosphoramidon-treated larvae, but the 
larvae could not open up the old cuticle and escape. The 
phosphoramidon-treated larvae were in poor shape by the end of 
the study. While not being bound by theory, it is believed 
that these phenomena suggest two distinct effects and that the 

15 inhibitors may be targeting different enzymes. 

Example 11 

Example 11 describes the isolation and characterization 
of protein-containing fractions from excretory/secretory (ES) 
products of D. immitis larvae. 
20 ES products from about 11,600 D. immitis L3/L4 were 

collected and concentrated as described, for example, in U.S. 
Patent Application Serial No. 08/153,554, ibid. The buffer was 
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exchanged to 20 mM piperazine-HCl, pH 6,0, 0,005% Brij 35 
using a Centriprep 10 (available from Amicon Inc , , Beverly, 
MA) . The resulting mixture was separated on a Mono-Q column 
(anion exchange) (available from Pharmacia Biotech Inc., 
5 Piscataway, NJ) equilibrated in 20 mM piperazine-HCl, pH 6,0 
using an increasing gradient of sodium chloride. Each 
fraction was then brought to 0,005% Brij 35, Buffer A was 20 
xnM piperazine-HCl, pH 6.0. Buffer B was 1 M sodium chloride 
in 2 0 mM piperazine-HCl, pH 6.0. The chromatography program 

10 was: (a) 0% B for 5 minutes; (b) 0% to about 50% B over 25 
minutes; (c) hold 3 minutes; (d) about 50% to 100% B over 5 
minutes. The flow rate was 0 , 5 ml per minute. Fractions were 
collected every minute. 

The collected fractions were assayed for metalloprotease 

15 activity using the fluorogenic compound H-phenylalanine-V- 
amido-4-methylcoumarin (H-Phe-AMC) . The assay was conducted 
as follows. A stock solution of 10 mM H-Phe-AMC in dimethyl 
sulfoxide (DMSO) was diluted 1:200 with 100 mM Tris-HCl, pH 
7.0. About 100 microliters (pi) of the diluted stock solution 

20 was placed in each of a desired number of wells in a 96-well 
microtiter plate . To each well was added 25 lal of the 
fractions, or control samples, to be tested. The resulting 
mixture was incubated at about 37 °C for at least 2 hours. The 
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microtiter plate was then placed on a UV light box and 
photographed to identify fractions in which AMC was cleaved 
(released AMC glows under such conditions) . 

Fractions 20 and 21 collected from the anion exchange 
5 both exhibited metalloproteolytic activity. An aliquot of 
fraction 21 was concentrated and evaluated by SDS-PAGE (14% 
Tris-glycine) . One major band was detected that migrated with 
an apparently molecule weight of about 60 kD. 

Anion exchange fraction 21 was then applied to size 

10 exclusion chromatography as described in U,S, Patent 
Application Serial No* 08/153;. 554, ibid. Specifically, 
Fraction 21 from the anion exchange column was applied to a 
TSK 3000 SW column (available from Beckman Instruments Inc., 
Fullerton, CA) in 50 mM Tris-HCl, pH 7,5, 150 mM sodium 

15 chloride, at a flow rate of 0.5 ml per minute. Fractions were 
collected every 0.5 minutes. When assayed using the 
microtiter plate fluorescent assay, fractions 21 and 22 were 
positive. The relative time of elution of these fractions was 
very close to the elution time of bovine serum albumin, which 

20 has a molecular weight of about 62 to 66 kD. 

Anion exchange fraction 21 was also submitted to 
isoelectric focussing under "native" conditions. The 
resultant gel was sliced into 1 mm fractions and the strips 
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assayed by the microtiter plate fluorescent assay. The active 
fraction was in a region having a pi of about 6.8. 

In a separate study, ES from about 2500 larvae was 
submitted to electrophoresis through each of two lanes of 
5 Novex Zymogel (available from Novex, San Diego, CA) . Zymogel 
contains about 0.1 percent gelatin. The two lanes were soaked 
in 2.5% Triton X-100 for about 30 minutes and subsequently 
washed in reaction buffer (50 mM Tris-HCl, pH 7.0, 5 mM 
calcium chloride, 0.02% Brij 35 and 200 mM sodium chloride) 

10 for about 30 minutes. One lane was then incubated in reaction 
buffer at about 37°C for 66 hours. The other lane was 
incubated in reaction buffer containing 2 mM EDTA for the same 
amount of time. Both lanes were then stained in 0.5% CBB- 
R250, 40% methanol, 10% acetic acid and destained in 40% 

15 methanol, 10% acetic acid in order to detect collagenase 
activity. Activity was identified by a clear zone in a blue 
background. ES proteins displaying metalloprotease activity 
that was completely inhibited by EDTA migrated with apparent 
molecular weights of about 60 kD, about 95 kD and at least 

20 about 200 kD. 
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SEQUENCE LISTING 
The following Sequence Listing is submitted pursuant to 
37 CFR §1.821. A copy in computer readable form is also 
submitted herewith. The paper and computer readable forms of 
this Sequence Listing are the same. 



(!) GENERAL INFORMATION: 

(i) APPLICANT: Tripp, Cynthia Ann 
Frank, Glenn R. 
Grieve, Robert B. 

(ii) TITLE OF INVENTION: NOVEL PARASITE ASTACIN 
METALLOENDOPEPTIDASE PROTEINS 

(iii) NUMBER OF SEQUENCES: 3 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SHERIDAN ROSS P.C. 

(B) STREET: 1700 LINCOLN ST., SUITE 3500 

(C) CITY: DENVER 

(D) STATE: CO 

(E) COUNTRY: USA 

(F) ZIP: 80203 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Connell, Gary J. 

(B) REGISTRATION NUMBER: 32,020 

(C) REFERENCE/DOCKET NUMBER: 2618-21-1-Cl 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (303) 8 63-97 00 

(B) TELEFAX: (303) 8 63-0223 
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(2) INFORMATION FOR SEQ ID NO ; 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1299 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

TTTTTTTTTT TTTTTTTTGT TTCATTGTTC AGTCAGTGGA AAATTATCGA ACGCAGAAAG 60 

10 CATCACGAAA TACGTTAGAT CACATCAAAC AACTTATCAC CTTGAACGTA CAAAGAGAGA 12 0 

TTGGAAACAT AGATGATAAG ACATTAGCTG ATGAAATAGT ATTACAACGA CGGGATCCTG 180 

AGGCAAAATG GCATCATAAT GAACTATTCA TTAATGATCC AGATGCATAC TATCAAGGCG 2 40 

ATGTCGATTT GTCGGAAAAA CAAGCCGAAA TTCTAAGCGA ACATTTTAAA AATGAAATTG 3 00 

CTTTAACAGA GAAAGACGAC ACAATAATAC GGCGAAAAAA GAGCATTGGT CGTGAACCAT 3 60 

15 TTTACGTAAG ATGGAATCAT AAACGTCCCA TTAGCTATGA ATTTGCGGAA AGTATTCCAT 42 0 

TAGAAACACG TAGAAAAATT CGTTCAGCAA TAGCAATGTG GGAAGAACGA AGATGCATAC 48 0 

GATTCCAAGA AAATGGCCCA AATGTAGATC GAATTGAATT TTACGACGGT GGCGGTTGTT 540 

CAAGTTTTGT CGGCCGAACA GGAGGGAATT TCAATTTCAA CACCAGGATG TGATATTATT 600 

GGTATTATAT CACATGAAAT TGGTCATACT TTAGGAATAT TTCATGAGCA AGCACGTCGT 6 60 

2 0 GATC7\AAAAA ATCATATTTT TATTAATTAC AACAATATTC CATCAAGCCG TTGGAACAAT 72 0 

TTTTTTCCAT TATCAGAATA TGAAGCTGAT ATGTTTAATT TACCTTATGA TACAGGATCA 7 80 

GTAATGCACT ATGGTTCATA CGGATTTGCA AGAAATCCGT ATGAACCAAC TATTACAACA 8 40 

CGTGATAAAT TTCAACAGTA CACAATTGGG CAACGTGAAG GGCCATCATT TCTGGATTAT 900 

GCATCTGTTA AGCTTTATCT ACAAACGCAT TAATGATATT GTTATCAAAT GGATGATAAT 960 

25 TTCAATAAGT ATAAACAGCG CTTATCGTTG TACAGAACAA TGTGCTGATA TGCACTGCGA 102 0 

TCATAATGGT TATCCGGATC CTAATAATTG CGCGAAATGC TTGTGTCCAG ATGGTTTTGC 108 0 

TGGTCGTACC TGTCAATTTG TTCAATATAC ATCTTGCGGA GCTCTCATTA AGGTAAGTAT 1140 

TGTCTTTTGA CCTCTTCTCT GACTAAAATA TAAGTTAAGC ATATGTATCT TCCGTCTAAT 12 00 

GATTTTCTTG ATTTTGATTT GTTCAATGCT CTTCTTGATA ATAATATAAA AATTTTTGAA 1260 

3 0 AATAAAGTTA ACTTTTGGTC AAAAAAAAAA AAAAAAAAA 12 99 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2126 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

GAAAGCATCA CGAAATACGT TAGATCACAT CAAACAACTT ATCACCTTGA ACGTACAAAG 60 

10 AGAGATTGGA AACATAGATG ATAAGACATT AGCTGATGAA ATAGTATTAC AACGACGGGA 12 0 

TCCTGAGGCA AAATGGCATC ATAATGAACT ATTCATTAAT GATCCAGATG CATACTATCA 18 0 

AGGCGATGTC GATTTGTCGG AAAAACAAGC CGAAATTCTA AGCGAACATT TTAAAAATGA 24 0 

AATTGCTTTA ACAGAGAAAG ACGACACAAT AATACGGCGA AT^AT^AGAGCA TTGGTCGTGA 30 0 

ACCATTTTAC GTAAGATGGA ATCATAAACG TCCCATTAGC TATGAATTTG CGGAAAGTAT 3 60 

15 TCCATTAGAA ACACGTAGAA AAATTCGTTC AGCAATAGCA ATGTGGGAAG AACGAACATG 420 

CATACGATTC CAAGAAAATG GCCCAAATGT AGATCGAATT GAATTTTACG ACGGTGGCGG 48 0 

TTGTTCAAGT TTTGTCGGCC GAACAGGAGG GAATTTCAAT TTCAACACCA GGATGTGATA 540 

TTATTGGTAT TATATCACAT GAAATTGGTC ATACTTTAGG AATATTTCAT GAGCAAGCAC 600 

GTCGTGATCA AAAAAATCAT ATTTTTATTA ATTACAACAA TATTCCATCA AGCCGTTGGA 660 

2 0 ACAATTTTTT TCCATTATCA GAATATGAAG CTGATATGTT TAATTTACCT TATGATACAG 72 0 

GATCAGTAAT GCACTATGGT TCATACGGAT TTGCAAGAAA TCCGTATGAA CCAACTATTA 780 

CAACACGTGA TAAATTTCAA CAGTACACAA TTGGGCAACG TGAAGGGCCA TCATTTCTGG 840 

ATTATGCATC TGATAAACAG CGCTTATCGT TGTACAGAAC AATGTGCTGA TATGCACTGC 900 

GATCATAATG GTTATCCGGA TCCTAATAAT TGCGCGAAAT GCTTGTGTCC AGATGGTTTT 960 

25 GCTGGTCGTA CCTGTCAATT TGTTCAATAT ACATCTTGCG GAGCTCTCAT TAAGGCGAGG 102 0 

AAAATGCCTG TTACGATTTC GAGCCCAAAT TATCCAAACT TCTTCAATGT TGGTGATCAA 108 0 

TGTATTTGGT TGCTTACAGC TCCACGCGTG ATTCGTAAAT TTGCAGTTTG TTGAACAATT 114 0 

TCAATTACAA TGTGAAGATA CGTGTGATAA ATCCTATGTA GAAGTGAAAG CTGACGCTGA 12 00 

TTTTCGACCT ACTGGATATC GATTTTGTTG TTCGCGAGTG CCACGTCATA TTTTTCAATC 12 60 

30 TGCGACAAAC GAGATGGTAG TAATATTTCG CGGTTTTGGT GATGCGGGAA ATGGCTTTAA 132 0 
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AGCTAAAATT TGGTCAAACG TAGATGATGA TATAGCTAAT ACAATTGTAA CAACTGAAAT 138 0 

GGCAAAAATT TCGGAAAAAA TACCGAAGCT AACAGTTCCA ATAGTTAAAA CTATTACCAC 1440 

TCCTACAATA ACAACTACTA CTGCTTTCAT GATATCACCC AAGAAAGGCA ATGTCACCGC 15 00 

CACGAGAGTT GCTATCACTA CTACGCCGAC TACTACAATT ACTACGACTA TTGCCGGTAC 15 60 

GTACCAATCA CCGTAACTAA TAATACTACA CCTGTAGTAA GTGAAACTTT ACCATCATTG 162 0 

CCAGTCAAGA TTCGA7\ACAA AATAGGTGCA TGCGAATGTG GTGAATGGAC AGAATGGACA 168 0 

GGTCCATGCT CTCAAGAATG TGGCGGTTGC GGAAAACGTC TTCGAACACG TCAGTGTTCA 1740 

TCAGATACGG AATGTAGAAC AGAAGAAAAA CGTGCGTGTG CTTTTAAGTT TGCCCATACG 18 0 0 

GGACTAATTT CCTTATCAAT AATGGAGAGT TTCATATACT TTGGAAGGGC TGCTGTGTTG 18 60 

GTCTATTCCG ATCGGGAGAT ATGTGTTCAG CACTTGATGA TAACGAGAAT CCATTTCTGA 192 0 

AATTTCTAGA ATCACTGTTG AACATGCAAG ATTCTCGAAA AAACGATAAT TTGCCTGACT 198 0 

CGAAAAAGAA GTGATTGAAT GATTCGATAA TATTGATTAA TAAAACGGGT TGTATTCTCG 2 0 40 

TCATAGAGTA TCCGTTGATG TTTTTATCCA AAAAATTCTC TTGCTTTTAA TTATTGTGAA 2100 

TAAAACTTTT GTTTACCCAA AAAAAA 212 6 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Cys Phe lie Val Gin Ser Val Glu Asn Tyr Arg Thr Gin Lys Ala Ser 
15 10 15 

Arg Asn Thr Leu Asp His lie Lys Gin Leu lie Thr Leu Asn Val Gin 
20 25 30 

Arg Glu lie Gly Asn lie Asp Asp Lys Thr Leu Ala Asp Glu lie Val 
35 40 45 

Leu Gin Arg Arg Asp Pro Glu Ala Lys Trp His His Asn Glu Leu Phe 
50 55 60 

lie Asn Asp Pro Asp Ala Tyr Tyr Gin Gly Asp Val Asp Leu Ser Glu 
65 70 75 80 
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Lys Gin Ala Glu lie Leu Ser Glu His Phe Lys Asn Glu lie Ala Leu 
85 90 95 

Thr Glu Lys Asp Asp Thr He He Arg Arg Lys Lys Ser He Gly Arg 
100 105 110 

5 Glu Pro Phe Tyr Val Arg Trp Asn His Lys Arg Pro He Ser Tyr Glu 

115 120 125 

Phe Ala Glu Ser He Pro Leu Glu Thr Arg Arg Lys He Arg Ser Ala 
130 135 140 

He Ala Met Trp Glu Glu Arg Thr Cys He Arg Phe Gin Glu Asn Gly 
10 145 150 155 160 

Pro Asn Val Asp Arg He Glu Phe Tyr Asp Gly Gly Gly Cys Ser Ser 
165 170 175 

Phe Val Gly Arg Thr Gly Gly Asn Phe Asn Phe Asn Thr Arg Met 
180 185 190 

15 (2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

2 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

He Glu Leu Asn Phe Thr Thr Val Ala Val Val Gin Val Leu Ser Ala 
15 10 15 

25 Glu Gin Glu Gly He Ser He Ser Thr Pro Gly Cys Asp He He Gly 

20 25 30 

He He Ser His Glu He Gly His Thr Leu Gly He Phe His Glu Gin 
35 40 45 

Ala Arg Arg Asp Gin Lys Asn His He Phe He Asn Tyr Asn Asn He 
30 50 55 60 

Pro Ser Ser Arg Trp Asn Asn Phe Phe Pro Leu Ser Glu Tyr Glu Ala 
65 70 75 80 

Asp Met Phe Asn Leu Pro Tyr Asp Thr Gly Ser Val Met His Tyr Gly 
85 90 95 

35 Ser Tyr Gly Phe Ala Arg Asn Pro Tyr Glu Pro Thr He Thr Thr Arg 

100 105 110 

Asp Lys Phe Gin Gin Tyr Thr He Gly Gin Arg Glu Gly Pro Ser Phe 
115 120 125 
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Leu Asp Tyr Ala Ser Val Lys Leu Tyr Leu Gin Thr His 
130 135 140 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(1) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: protein 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Cys Thr Met Val His Thr Asp Leu Gin Glu lie Arg Met Asn Gin Leu 
15 10 15 

Leu Gin His Val lie Asn Phe Asn Ser Thr Gin Leu Gly Asn Val Lys 
20 25 30 

15 Gly His His Phe Trp lie Met His Leu Leu Ser Phe lie Tyr Lys Arg 

35 40 45 

lie Asn Asp lie Val lie Lys Trp Met lie lie Ser lie Ser lie Asn 
50 55 60 

Ser Ala Tyr Arg Cys Thr Glu Gin Cys Ala Asp Met His Cys Asp His 
20 65 70 75 80 

Asn Gly Tyr Pro Asp Pro Asn Asn Cys Ala Lys Cys Leu Cys Pro Asp 
85 90 95 

Gly Phe Ala Gly Arg Thr Cys Gin Phe Val Gin Tyr Thr Ser Cys Gly 
100 105 110 

25 Ala Leu lie Lys Val Ser lie Val Phe 

115 120 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 amino acids 
30 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

35 Lys Ala Ser Arg Asn Thr Leu Asp His lie Lys Gin Leu lie Thr Leu 

15 10 15 

Asn Val Gin Arg Glu lie Gly Asn lie Asp Asp Lys Thr Leu Ala Asp 
20 25 30 
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Glu lie Val Leu Gin Arg Arg Asp Pro Glu Ala Lys Trp His His Asn 
35 40 45 

Glu Leu Phe lie Asn Asp Pro Asp Ala Tyr Tyr Gin Gly Asp Val Asp 
50 55 60 

5 Leu Ser Glu Lys Gin Ala Glu lie Leu Ser Glu His Phe Lys Asn Glu 

65 70 75 80 

lie Ala Leu Thr Glu Lys Asp Asp Thr lie lie Arg Arg Lys Lys Ser 
85 90 95 

lie Gly Arg Glu Pro Phe Tyr Val Arg Trp Asn His Lys Arg Pro lie 
10 100 105 110 

Ser Tyr Glu Phe Ala Glu Ser lie Pro Leu Glu Thr Arg Arg Lys lie 
115 120 125 

Arg Ser Ala lie Ala Met Trp Glu Glu Arg Thr Cys lie Arg Phe Gin 
130 135 140 

15 Glu Asn Gly Pro Asn Val Asp Arg lie Glu Phe Tyr Asp Gly Gly Gly 

145 150 155 160 

Cys Ser Ser Phe Val Gly Arg Thr Gly Gly Asn Phe Asn Phe Asn Thr 
165 170 175 
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(2) INFORMATION FOR SEQ ID NO : 7 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 
25 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

lie Glu Leu Asn Phe Thr Thr Val Ala Val Val Gin Val Leu Ser Ala 
30 1 5 10 15 

Glu Gin Glu Gly lie Ser lie Ser Thr Pro Gly Cys Asp lie lie Gly 
20 25 30 

He He Ser His Glu He Gly His Thr Leu Gly He Phe His Glu Gin 
35 40 45 

35 Ala Arg Arg Asp Gin Lys Asn His He Phe He Asn Tyr Asn Asn He 

50 55 60 

Pro Ser Ser Arg Trp Asn Asn Phe Phe Pro Leu Ser Glu Tyr Glu Ala 
65 70 75 80 
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Asp Met Phe Asn Leu Pro Tyr Asp Thr Gly Ser Val Met His Tyr Gly 
85 90 95 

Ser Tyr Gly Phe Ala Arg Asn Pro Tyr Glu Pro Thr lie Thr Thr Arg 
100 105 110 

5 Asp Lys Phe Gin Gin Tyr Thr lie Gly Gin Arg Glu Gly Pro Ser Phe 

115 120 125 

Leu Asp Tyr Ala Ser Asp Lys Gin Arg Leu Ser Leu Tyr Arg Thr Met 
130 135 140 

Cys 

10 145 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 
15 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Cys Thr Met Val His Thr Asp Leu Gin Glu lie Arg Met Asn Gin Leu 
20 1 5 10 15 

Leu Gin His Val lie Asn Phe Asn Ser Thr Gin Leu Gly Asn Val Lys 
20 25 30 

Gly His His Phe Trp lie Met His Leu lie Asn Ser Ala Tyr Arg Cys 
35 40 45 

25 Thr Glu Gin Cys Ala Asp Met His Cys Asp His Asn Gly Tyr Pro Asp 

50 55 60 

Pro Asn Asn Cys Ala Lys Cys Leu Cys Pro Asp Gly Phe Ala Gly Arg 
65 70 75 80 

Thr Cys Gin Phe Val Gin Tyr Thr Ser Cys Gly Ala Leu lie Lys Ala 
30 85 90 95 

Arg Lys Met Pro Val Thr lie Ser Ser Pro Asn Tyr Pro Asn Phe Phe 
100 105 110 

Asn Val Gly Asp Gin Cys lie Trp Leu Leu Thr Ala Pro Arg Val lie 
115 120 125 

35 Arg Lys Phe Ala Val Cys 

130 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 
5 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Phe Val Asn Leu Gin Phe Val Glu Gin Phe Gin Leu Gin Cys Glu Asp 
10 1 5 10 15 

Thr Cys Asp Lys Ser Tyr Val Glu Val Lys Ala Asp Ala Asp Phe Arg 
20 25 30 

Pro Thr Gly Tyr Arg Phe Cys Cys Ser Arg Val Pro Arg His lie Phe 
35 40 45 

15 Gin Ser Ala Thr Asn Glu Met Val Val lie Phe Arg Gly Phe Gly Asp 

50 55 60 

Ala Gly Asn Gly Phe Lys Ala Lys lie Trp Ser Asn Val Asp Asp Asp 
65 70 75 80 

lie Ala Asn Thr lie Val Thr Thr Glu Met Ala Lys lie Ser Glu Lys 
20 85 90 95 

lie Pro Lys Leu Thr Val Pro lie Val Lys Thr lie Thr Thr Pro Thr 
100 105 110 

lie Thr Thr Thr Thr Ala Phe Met lie Ser Pro Lys Lys Gly Asn Val 
115 120 125 

25 Thr Ala Thr Arg Asx Ala lie Thr Thr Thr Pro Thr Thr Thr lie Thr 

130 135 140 

Thr Thr lie Ala Gly Thr Tyr Gin Ser Pro 
145 150 

(2) INFORMATION FOR SEQ ID NO: 10: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asn Tyr Tyr His Ser Tyr Asn Asn Asn Tyr Tyr Cys Phe His Asp lie 
15 10 15 
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Thr Gin Glu Arg Gin Cys His Arg His Glu Ser Cys Tyr His Tyr Tyr 
20 25 30 

Ala Asp Tyr Tyr Asn Tyr Tyr Asp Tyr Cys Arg Tyr Val Pro lie Thr 
35 40 45 

5 Val Thr Asn Asn Thr Thr Pro Val Val Ser Glu Thr Leu Pro Ser Leu 

50 55 60 

Pro Val Lys lie Arg Asn Lys lie Gly Ala Cys Glu Cys Gly Glu Trp 
65 70 75 80 

Thr Glu Trp Thr Gly Pro Cys Ser Gin Glu Cys Gly Gly Cys Gly Lys 
10 85 90 95 

Arg Leu Arg Thr Arg Gin Cys Ser Ser Asp Thr Glu Cys Arg Thr Glu 
100 105 110 

Glu Lys Arg Ala Cys Ala Phe Lys Phe Ala His Thr Gly Leu lie Ser 
115 120 125 

15 Leu Ser lie Met Glu Ser Phe lie Tyr Phe Gly Arg Ala Ala Val Leu 

130 135 140 

Val Tyr Ser Asp Arg Glu lie Cys Val Gin His Leu Met lie Thr Arg 
145 150 155 160 
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(2) INFORMATION FOR SEQ ID NO: 11: 



{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 8 amino acids 

(B) TYPE: amino acid 
2 5 (C) STRAMDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Lys Ala Ser Arg Asn Thr Leu Asp His lie Lys Gin Leu lie Thr Leu 
30 1 5 10 15 

Asn Val Gin Arg Glu lie Gly Asn lie Asp Asp Lys Thr Leu Ala Asp 
20 25 30 

Glu lie Val Leu Gin Arg Arg Asp Pro Glu Ala Lys Trp His His Asn 
35 40 45 

35 Glu Leu Phe lie Asn Asp Pro Asp Ala Tyr Tyr Gin Gly Asp Val Asp 

50 55 60 



138 



2618'21-1-C1 



Leu Ser Glu Lys Gin Ala Glu lie Leu Ser Glu His Phe Lys Leu Asn 
65 70 75 80 

Glu lie Ala Leu Thr Glu Lys Asp Asp Thr lie lie Arg Arg Lys Lys 
85 90 95 

5 Ser lie Gly Arg Glu Pro Phe Tyr Val Arg Trp Asn His Lys Arg Pro 

100 105 110 

lie Ser Tyr Glu Phe Ala Glu Ser lie Pro Leu Glu Thr Arg Arg Lys 
115 120 125 

lie Arg Ser Ala lie Ala Met Trp Glu Glu Arg Thr Cys lie Arg Phe 
10 130 135 140 

Gin Glu Asn Gly Pro Asn Val Asp Arg lie Glu Phe Tyr Asp Gly Gly 
145 150 155 160 

Gly Cys Ser Ser Phe Val Gly Arg Gin Glu Gly lie Ser lie Ser Thr 
165 170 175 

15 Pro Gly Cys Asp lie lie Gly lie lie Ser His Glu lie Gly His Thr 

180 185 190 

Leu Gly lie Phe His Glu Gin Ala Arg Arg Asp Gin Lys Asn His lie 
195 200 205 

Phe lie Asn Tyr Asn Asn lie Pro Ser Ser Arg Trp Asn Asn Phe Phe 
20 210 215 220 

Pro Leu Ser Glu Tyr Glu Ala Asp Met Phe Asn Leu Pro Tyr Asp Thr 
225 230 235 240 

Gly Ser Val Met His Tyr Gly Ser Tyr Gly Phe Ala Arg Asn Pro Tyr 
245 250 255 

25 Glu Pro Thr lie Thr Thr Arg Asp Lys Phe Gin Gin Tyr Thr lie Gly 

260 265 270 

Gin Arg Glu Gly Pro Ser Phe Leu Asp Met His Leu lie Asn Ser Ala 
275 280 285 

Tyr Arg Cys Thr Glu Gin Cys Ala Asp Met His Cys Asp His Asn Gly 
30 290 295 300 

Tyr Pro Asp Pro Asn Asn Cys Ala Lys Cys Leu Cys Pro Asp Gly Phe 
305 310 315 320 

Ala Gly Arg Thr Cys Gin Phe Val Gin Tyr Thr Ser Cys Gly Ala Leu 
325 330 335 

3 5 lie Lys Ala Arg Lys Met Pro Val Thr lie Ser Ser Pro Asn Tyr Pro 

340 345 350 

Asn Phe Phe Asn Tyr Gly Asp Gin Cys lie Trp Leu Leu Thr Ala Pro 
355 360 365 
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Arg Val Phe Val Asn Leu Gin Phe Val Glu Gin Phe Gin Leu Gin Cys 
370 375 380 

Glu Asp Thr Cys Asp Lys Ser Tyr Val Glu Val Lys Ala Asp Ala Asp 
385 390 395 400 

5 Phe Arg Pro Thr Gly Tyr Arg Phe Cys Cys Ser Arg Val Pro Arg His 

405 410 415 

lie Phe Gin Ser Ala Thr Asn Glu Met Val Val lie Phe Arg Gly Phe 
420 425 430 

Gly Asp Ala Gly Asn Gly Phe Lys Ala Lys lie Trp Ser Asn Val Asp 
10 435 440 445 

Asp Asp lie Ala Asn Thr lie Val Thr Thr Glu Met Ala Lys lie Ser 
450 455 460 

Glu Lys lie Pro Lys Leu Thr Val Pro lie Val Lys Thr lie Thr Thr 
465 470 475 480 

15 Pro Thr lie Thr Thr Thr Thr Ala Phe Met lie Ser Pro Lys Lys Gly 

485 490 495 

Asn Val Thr Ala Thr Arg Val Ala lie Thr Thr Thr Pro Thr Thr Thr 
500 505 510 

lie Thr Thr Thr lie Ala Gly Thr Tyr Gin Ser Val Thr Asn Asn Thr 
20 515 520 525 

Thr Pro Val Val Ser Glu Thr Leu Pro Ser Leu Pro Val Lys lie Arg 
530 535 540 

Asn Lys lie Gly Ala Cys Glu Cys Gly Glu Trp Thr Glu Trp Thr Gly 
545 550 555 560 

25 Pro Cys Ser Gin Glu Cys Gly Gly Cys Gly Lys Arg Leu Arg Thr Arg 

565 570 575 

Gin Cys Ser Ser Asp Thr Glu Cys Arg Thr Glu Glu Lys Arg Ala Cys 
580 585 590 

Ala Phe Lys Phe Ala His Thr Gly Leu lie Ser Leu Ser lie Met Glu 
30 595 600 605 

Ser Phe lie Tyr Phe Gly Arg Ala Ala Val Leu Val Tyr Ser Asp Arg 
610 615 620 

Glu lie Cys Val Gin His Leu Met lie Thr Arg lie His Phe 
625 630 635 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 
10 (B) LOCATION: 1..143 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GGA TCC TGT GGT TCA TGT TGG GOT TTT TCT GTT ACT GGC AAT ATT GCA 48 
Gly Ser Cys Gly Ser Cys Trp Ala Phe Ser Val Thr Gly Asn lie Ala 
15 10 15 

15 AGT CTC TGG GCT ATT AAA ACA GGT GAT TTG ATA TCG CTT TCC GAG CAA 96 
Ser Leu Trp Ala lie Lys Thr Gly Asp Leu lie Ser Leu Ser Glu Gin 
20 25 30 

GJ\A TTG ATT GAT TGT GAT GTG GTT GAT GAG GGC TGC AAC GGC GGC TA 143 
Glu Leu lie Asp Cys Asp Val Val Asp Glu Gly Cys Asn Gly Gly 
20 35 40 45 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Gly Ser Cys Gly Ser Cys Trp Ala Phe Ser Val Thr Gly Asn lie Ala 
15 10 15 

30 Ser Leu Trp Ala lie Lys Thr Gly Asp Leu lie Ser Leu Ser Glu Gin 

20 25 30 

Glu Leu lie Asp Cys Asp Val Val Asp Glu Gly Cys Asn Gly Gly 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 14: 

35 (i) SEQUENCE CH7\RACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



141 



(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label= INOSINE 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /label= INOSINE 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ACWCATGAAA TNGSNCAT 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
AATACGACTC ACTATAG 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TGGTATTATA TCACATGAAA TTGGTCATAC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CCCAATTGTG TACTGTTGAA ATTTATCAC 2 9 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CGGGATCCTG TGG\ArTCATGY TGGGC 25 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

2 0 TANCCNCCRT TRCANCCYTC 2 0 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 689 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

TCACATGAAA TTGGTCATAC TTTAGGAATA TTTCATGAGC AAGCACGTCG TGATCAAAAA 60 

3 0 AATCATATTT TTATTAATTA CAACAATATT CCATCAAGCC GTTGGAACAA TTTTTTTCCA 12 0 

TTATCAGAAT ATGAAGCTGA TATGTTTAAT TTACCTTATG ATACAGGATC AGTAATGCAC 18 0 

TATGGTTCAT ACGGATTTGC AAGAAATCCG TATGAACCAA CTATTACAAC ACGTGATAAA 240 

TTTCAACAGT ACACAATTGG GCAACGTGAA GGGCCATCAT TTCTGGATTA TGCATCTGTT 30 0 

AAGCTTTATC TACAAACGCA TTAATGATAT TGTTATCAAA TGGATGATAA TTTCAATAAG 3 60 
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TATAAACAGC GCTTATCGTT GTACAGAACA ATGTGCTGAT ATGCACTGCG ATCATAATGG 42 0 

TTATCCGGAT CCTAATAATT GCGCGAAATG CTTGTGTCCA GATGGTTTTG CTGGTCGTAC 480 

CTGTCAATTT GTTCAATATA CATCTTGCGG AGCTCTCATT AAGGTAAGTA TTGTCTTTTG 540 

ACCTCTTCTC TGACTAAAAT ATAAGTTAAG CATATGTATC TTCCGTCTAA TGATTTTCTT 600 

5 GATTTTGATT TGTTCAATGC TCTTCTTGAT AATAATATAA AAATTTTTGA AAATAAAGTT 660 

AACTTTTGGT CAAAAAAAAA AAA7\AAAAA 68 9 

(2) INFORMATION FOR SEQ ID N0:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

15 GATCCTGAGG CAAAATGGCA TCATAATGAA CTATTCATTA ATGATCCAGA TGCATACTAT 60 

CAAGGCGATG TCGATTTGTC GGAAAAACAA GCCGTWITTC TAAGCGAACA TTTTAAAAAT 12 0 

GAAATTGCTT TAACAGAGAA AGACGACACA ATAATACGGC GAAAAAAGAG CATTGGTCGT 18 0 

GAACCATTTT ACGTAAGATG GAATCATAAA CGTCCCATTA GCTATGAATT TGCGGAAAGT 24 0 

ATTCCATTAG AAACACGTAG A7\A7\ATTCGT TCAGCAATAG CAATGTGGGA AGAACGAACA 30 0 

2 0 TGCATACGAT TCCAAGAAAA TGGCCCAAAT GTAGATCGAA TTGAATTTTA CGACGGTGGC 360 

GGTTGTTCAA GTTTTGTCGG CCGAACAGGA GGGAATTTCA ATTTCAACAC CAGGATGTGA 42 0 

TATTATTGGT ATTATATCAC ATGAAATTGG TCATACTTTA GGAATATTTC ATGAGC^JVGC 48 0 

ACGTCGTGAT CAAAAA7\ATC ATATTTTTAT TAATTACAAC AATATTCCAT C7\AGCCGTTG 540 

GAACAATTTT TTTCCATTAT CAGAATATGA AGCTGATATG TTTAATTTAC CTTATGATAC 600 

25 AGGATCAGTA ATGCACTATG GTTCATACGG ATTTGCAAGA AATCCGTATG AACCAACTAT 660 

TACAACACGT GATAAATTTC AACAGTACAC AATTGGGCAA CGTGAAGGGC CATCATTTCT 720 

GGATTATGCA TCTGATAAAC AGCGCTTATC GTTGTACAGA ACAATGTGCT GATATGCACT 780 

GCGATCATAA TGGTTATCCG GATC 8 04 
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(2) INFORMZYTION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

TGGTATTATA TCACATGAAA TTGGTCATAC TTTAGGAATA TTTCATGAGC AAGCACGTCG 60 

10 TGATCAAAAA AATCATATTT TTATTAATTA CAACAATATT CCATCAAGCC GTTGGAACAA 12 0 

TTTTTTTCCA TTATCAGAAT ATGAAGCTGA TATGTTTAAT TTACCTTATG ATACAGGATC 18 0 

AGTAATGCAC TATGGTTCAT ACGGATTTGC AAGAAATCCG TATGAACCAA CTATTACAAC 240 

ACGTGATAAA TTTCAACAGT ACACAATTGG G 271 
(2) INFORMATION FOR SEQ ID NO:23: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

2 0 (ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3. . 140 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

25 GA TCC TGT GGT TCA TGT TGG GCT TTT TCT GTT ACT GGC AAT ATT GCA 47 
Ser Cys Gly Ser Cys Trp Ala Phe Ser Val Thr Gly Asn lie Ala 
15 10 15 

AGT CTC TGG GCT ATT AAA ACA GGT GAT TTG ATA TCG CTT TCC GAG CAA 95 
Ser Leu Trp Ala lie Lys Thr Gly Asp Leu lie Ser Leu Ser GIu Gin 
30 20 25 30 

GAA TTG ATT GAT TGT GAT GTG GTT GAT GAG GGC TGC AAC GGC GGC 140 
Glu Leu lie Asp Cys Asp Val Val Asp Glu Gly Cys Asn Gly Gly 
35 40 45 

TA 142 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Ser Cys Gly Ser Cys Trp Ala Phe Ser Val Thr Gly Asn lie Ala Ser 
15 10 15 

10 Leu Trp Ala lie Lys Thr Gly Asp Leu lie Ser Leu Ser Glu Gin Glu 

20 25 30 

Leu lie Asp Cys Asp Val Val Asp Glu Gly Cys Asn Gly Gly 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 25: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

2 0 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GTCGGATCCG CAGGAGGGAA TTTCAATTTC AACA 34 
(2) INFORMATION FOR SEQ ID NO: 26: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

30 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

TCAAGATCTA ATCCAGAAAT GATGGCCCTT CACG 34 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

10 GGAAACAGCT ATGACCATG 19 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 8 : 

2 0 GTAAAACGAC GGCCAGT 17 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2292 base pairs 

(B) TYPE: nucleic acid 

2 5 ( C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
30 (B) LOCATION: 1459 

(D) OTHER INFORMATION: /note= "1459 :S=C or G; 
aa4 63 :Xaa=Alanine or Glycine" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

TAGATTTCGA TTCGTCTTTG TTAATTCATC TTCGTCAGAT TTATTAGAGA AAAATAAAAA 60 

35 TTTTGATCGC AATGAAGCAG GTTATCATCT TTCCTCAGCT TTTCATTTGT TTCATTGTTC 12 0 

AGTCAGTGGA AAATTATCGA ACGCAGAAAG CATCACGAAA TACGTTAGAT CACATCAAAC 18 0 

AACTTATCAC CTTGAACGTA CAAAGAGAGA TTGGAAACAT AGATGATAAG ACATTAGCTG 240 
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ATGAAATAGT ATTACAACGA CGGGATCCTG AGGC7\AAATG GCATCATAAT GAACTATTCA 3 00 

TTAATGATCC AGATGCATAC TATCAAGGCG ATGTCGATTT GTCGGAAAAA CAAGCCGA?lA 360 

TTCTAAGCGA ACATTTTAAA AATGAAATTG CTTTAACAGA GAAAGACGAC ACAATAATAC 42 0 

GGCGAAAAAA GAGCATTGGT CGTGAACCAT TTTACGTAAG ATGGAATCAT AAACGTCCCA 48 0 

5 TTAGCTATGA ATTTGCGG7\A AGTATTCCAT TAGA7\ACACG TAGAAAAATT CGTTCAGCAA 540 

TAGCAATGTG GGAAGAACGA AGATGCATAC GATTCCAAGA AAATGGCCCA AATGTTGATC 600 

GAATTGAATT TTACGACGGT GGCGGTTGTT CAAGTTTTGT CGGCCGAACA GGAGGCACGC 660 

AAGGAATTTC AATTTCAACA CCAGGATGTG ATATTATTGG TATTATATCA CATGAAATTG 72 0 

GTCATACTTT AGGAATATTT CATGAGCAAG CACGTCGTGA TCAAAAAAAT CATATTTTTA 7 80 

10 TTAATTACAA CAATATTCGA TCAAGCCGTT GGAACAATTT TTTTCCATTA TCAGAATATG 8 40 

AAGCTGATAT GTTTAATTTA CCTTATGATA CAGGATCAGT AATGCACTAT GGTTCATACG 9 00 

GATTTGCAAG AAATCCGTAT GAACCAACTA TTACAACACG TGATAAATTT CAACAGTACA 9 60 

CAATTGGGCA ACGTGAAGGG CCATCATTTC TGGATTATGC ATCTATAAAC AGCGCTTATC 102 0 

GTTGTACAGA ACAATGTGCT GATATGCACT GCGATCATAA TGGTTATCCG GATCCTAATA 10 8 0 

15 ATTGCGCGAA ATGCTTGTGT CCAGATGGTT TTGCTGGTCG TACCTGTCAA TTTGTTCAAT 114 0 

ATACATCTTG CGGAGCTCTC ATTAAGGCGA GGAAAATGCC TGTTACGATT TCGAGCCCAA 12 00 

ATTATCCA?A CTTCTTCAAT GTTGGTGATC AATGTATTTG GTTGCTTACA GCTCCACGCG 12 60 

GTGGATTCGT AAATTTGCAG TTTGTTGAAC AATTTCAATT ACAATGTGAA GATACGTGTG 132 0 

ATAAATCCTA TGTAGAAGTG AAAGCTGACG CTGATTTTCG ACCTACTGGA TATCGATTTT 138 0 

2 0 GTTGTTCGCG AGTGCCACGT CATATTTTTC AATCTGCGAC AAACGAGATG GTAGTAATAT 1440 

TTCGCGGTTT TGGTGATGCG GGAAATGGCT TTAAAGCTAA AATTTGGTCA AACGTAGATG 1500 

ATGATATAGC TAATACAATT GTAACAACTG AAATGGCAAA AATTTCGGAA AAAATACCGA 15 60 

AGCTAACAGT TCCAATAGTT AAAACTATTA CCACTCCTAC AATAACAACT ACTACTGCTT 162 0 

TCATGATATC ACCCAAGAAA GGCAATGTCA CCGCCACGAG AGTTGCTATC ACTACTACGC 168 0 

25 CGACTACTAC AATTACTACG ACTATTGCCG GTACGGTACC AATCACCGTA ACTAATAATA 1740 

CTACACCTGT AGTAAGTGAA ACTTTACCAT CATTGCCAGT CAAGATTCGA AACAAAATAG 18 00 

GTGCATGCGA ATGTGGTGAA TGGACAGAAT GGACAGGTCC ATGCTCTCAA GAATGTGGCG 18 60 

GTTGCGGAAA ACGTCTTCGA ACACGTCAGT GTTCATCAGA TACGGAATGT AGAACAGAAG 192 0 
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/WWVCGTGC GTGTGCTTTT AAAGTTTGCC CATACGGGAC TAATTTCCTT ATCAATAATG 



GAGAGTTTCA TATACTTTGG AAGGGCTGCT GTGTTGGTCT ATTCCGATCG GGAGATATGT 
GTTCAGCACT TGATGATAAC GAGAATCCAT TTCTGAAATT TCTAGAATCA CTGTTGAACA 
TGCAAGATTC TCGAAAAAAC GATAATTTGC CTGACTCGAA AAAGAAGTGA TTGAATGATT 
CGATAATATT GATTAATAAA ACGGGTTGTA TTCTCGTCAT AGAGTATCCG TTGATGTTTT 
TATCCAAAAA ATTCTCTTGC TTTTAATTAT TGTGAATAAA ACTTTTGTTT ACCCAAAAAA 
AAAAAAA7\AA AA 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2076 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2076 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

ATG AAG CAG GTT ATC ATC TTT CCT CAG CTT TTC ATT TGT TTC ATT GTT 
Met Lys Gin Val lie lie Phe Pro Gin Leu Phe lie Cys Phe lie Val 
15 10 15 

CAG TCA GTG GAA AAT TAT CGA ACG CAG AAA GCA TCA CGA AAT ACG TTA 
Gin Ser Val Glu Asn Tyr Arg Thr Gin Lys Ala Ser Arg Asn Thr Leu 
20 25 30 

GAT CAC ATC AAA CAA CTT ATC ACC TTG AAC GTA CAA AGA GAG ATT GGA 
Asp His lie Lys Gin Leu lie Thr Leu Asn Val Gin Arg Glu lie Gly 
35 40 45 

AAC ATA GAT GAT AAG ACA TTA GCT GAT GAA ATA GTA TTA CAA CGA CGG 
Asn lie Asp Asp Lys Thr Leu Ala Asp Glu lie Val Leu Gin Arg Arg 
50 55 60 

GAT CCT GAG GCA AAA TGG CAT CAT AAT GAA CTA TTC ATT AAT GAT CCA 
Asp Pro Glu Ala Lys Trp His His Asn Glu Leu Phe lie Asn Asp Pro 
65 70 75 80 



GAT GCA TAG TAT CAA GGC GAT GTC GAT TTG TCG GAA AAA CAA GCC GAA 

Asp Ala Tyr Tyr Gin Gly Asp Val Asp Leu Ser Glu Lys Gin Ala Glu 
85 90 95 
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ATT CTA AGC GAA CAT TTT AAA AAT GAA ATT GCT TTA ACA GAG AAA GAG 336 
lie Leu Ser Glu His Phe Lys Asn Glu lie Ala Leu Thr Glu Lys Asp 
100 105 110 

GAG ACA ATA ATA CGG CGA AAA AAG AGC ATT GGT CGT GAA CCA TTT TAG 38 4 

Asp Thr lie lie Arg Arg Lys Lys Ser lie Gly Arg Glu Pro Phe Tyr 
115 120 125 

GTA AGA TGG AAT CAT AAA CGT CCC ATT AGC TAT GAA TTT GCG GAA AGT 432 
Val Arg Trp Asn His Lys Arg Pro lie Ser Tyr Glu Phe Ala Glu Ser 
130 135 140 



10 ATT CCA TTA GAA ACA CGT AGA AAA ATT CGT TCA GCA ATA GCA ATG TGG 
lie Pro Leu Glu Thr Arg Arg Lys lie Arg Ser Ala lie Ala Met Trp 
145 150 155 160 



4 0 TCA TTT CTG GAT TAT GCA TCT ATA AAC AGC GCT TAT CGT TGT ACA GAA 
Ser Phe Leu Asp Tyr Ala Ser lie Asn Ser Ala Tyr Arg Cys Thr Glu 
305 310 315 320 
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GAA GAA CGA ACA TGC ATA CGA TTC CAA GAA AAT GGC CCA AAT GTT GAT 52 8 

Glu Glu Arg Thr Cys lie Arg Phe Gin Glu Asn Gly Pro Asn Val Asp 
15 165 170 175 

CGA ATT GAA TTT TAG GAG GGT GGC GGT TGT TCA AGT TTT GTC GGC CGA 57 6 

Arg lie Glu Phe Tyr Asp Gly Gly Gly Cys Ser Ser Phe Val Gly Arg 
180 185 190 

ACA GGA GGC ACG CAA GGA ATT TCA ATT TCA ACA CCA GGA TGT GAT ATT 624 
2 0 Thr Gly Gly Thr Gin Gly lie Ser He Ser Thr Pro Gly Cys Asp He 
195 200 205 

ATT GGT ATT ATA TCA CAT GAA ATT GGT CAT ACT TTA GGA ATA TTT CAT 672 
He Gly He He Ser His Glu He Gly His Thr Leu Gly He Phe His 
210 215 220 

25 GAG CAA GCA CGT CGT GAT CAA AAA AAT CAT ATT TTT ATT AAT TAG AAC 72 0 

Glu Gin Ala Arg Arg Asp Gin Lys Asn His He Phe He Asn Tyr Asn 
225 230 235 240 

AAT ATT CCA TCA AGC CGT TGG AAC AAT TTT TTT CCA TTA TCA GAA TAT 7 68 

Asn He Pro Ser Ser Arg Trp Asn Asn Phe Phe Pro Leu Ser Glu Tyr 
30 245 250 255 

GAA GCT GAT ATG TTT AAT TTA CGT TAT GAT ACA GGA TCA GTA ATG CAC 816 
Glu Ala Asp Met Phe Asn Leu Pro Tyr Asp Thr Gly Ser Val Met His 
260 265 270 

TAT GGT TCA TAG GGA TTT GCA AGA AAT CGG TAT GAA CCA ACT ATT ACA 8 64 

35 Tyr Gly Ser Tyr Gly Phe Ala Arg Asn Pro Tyr Glu Pro Thr He Thr 
275 280 285 

ACA CGT GAT AAA TTT CAA GAG TAG ACA ATT GGG CAA CGT GAA GGG CCA 912 
Thr Arg Asp Lys Phe Gin Gin Tyr Thr He Gly Gin Arg Glu Gly Pro 
290 295 300 
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CAA TGT GCT GA.T ATG CAC TGC GAT CAT AAT GGT TAT CCG GAT OCT AAT 10 08 

Gin Cys Ala Asp Met His Cys Asp His Asn Gly Tyr Pro Asp Pro Asn 
325 330 335 

AAT TGC GCG AAA TGC TTG TGT CCA GAT GGT TTT GCT GGT CGT ACC TGT 1056 
5 Asn Cys Ala Lys Cys Leu Cys Pro Asp Gly Phe Ala Gly Arg Thr Cys 
340 345 350 

CAA TTT GTT CAA TAT ACA TCT TGC GGA GCT CTC ATT AAG GCG AGG AAA 1104 
Gin Phe Val Gin Tyr Thr Ser Cys Gly Ala Leu He Lys Ala Arg Lys 
355 360 365 

10 ATG CCT GTT ACG ATT TCG AGC CCA AAT TAT CCA AAC TTC TTC AAT GTT 1152 
Met Pro Val Thr He Ser Ser Pro Asn Tyr Pro Asn Phe Phe Asn Val 
370 375 380 

GGT GAT CAA TGT ATT TGG TTG CTT ACA GCT CCA CGC GGT GGA TTC GTA 12 00 

Gly Asp Gin Cys lie Trp Leu Leu Thr Ala Pro Arg Gly Gly Phe Val 
15 385 390 395 400 

AAT TTG CAG TTT GTT GAA CAA TTT CAA TTA CAA TGT GAA GAT ACG TGT 1248 
Asn Leu Gin Phe Val Glu Gin Phe Gin Leu Gin Cys Glu Asp Thr Cys 
405 410 415 

GAT AAA TCC TAT GTA GAA GTG AAA GCT GAC GCT GAT TTT CGA CCT ACT 12 9 6 

2 0 Asp Lys Ser Tyr Val Glu Val Lys Ala Asp Ala Asp Phe Arg Pro Thr 

420 425 430 

GGA TAT CGA TTT TGT TGT TCG CGA GTG CCA CGT CAT ATT TTT CAA TCT 1344 
Gly Tyr Arg Phe Cys Cys Ser Arg Val Pro Arg His He Phe Gin Ser 
435 440 445 

25 GCG ACA AAC GAG ATG GTA GTA ATA TTT CGC GGT TTT GGT GAT GCG GGA 1392 
Ala Thr Asn Glu Met Val Val He Phe Arg Gly Phe Gly Asp Ala Gly 
450 455 460 

AAT GGC TTT AAA GCT AAA ATT TGG TCA AAC GTA GAT GAT GAT ATA GCT 1440 
Asn Gly Phe Lys Ala Lys He Trp Ser Asn Val Asp Asp Asp He Ala 
30 465 470 475 480 

AAT ACA ATT GTA ACA ACT GAA ATG GCA AAA ATT TCG GAA AAA ATA CCG 1488 
Asn Thr He Val Thr Thr Glu Met Ala Lys He Ser Glu Lys He Pro 
485 490 495 

AAG CTA ACA GTT CCA ATA GTT AAA ACT ATT ACC ACT CCT ACA ATA ACA 1536 
35 Lys Leu Thr Val Pro He Val Lys Thr He Thr Thr Pro Thr He Thr 
500 505 510 

ACT ACT ACT GCT TTC ATG ATA TCA CCC AAG AAA GGC AAT GTC ACC GCC 1584 
Thr Thr Thr Ala Phe Met He Ser Pro Lys Lys Gly Asn Val Thr Ala 
515 520 525 

40 ACG AGA GTT GCT ATC ACT ACT ACG CCG ACT ACT ACA ATT ACT ACG ACT 1632 
Thr Arg Val Ala He Thr Thr Thr Pro Thr Thr Thr He Thr Thr Thr 
530 535 540 
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ATT GCC GGT ACG GTA CCA ATC ACC GTA ACT AAT AAT ACT ACA CCT GTA 168 0 

lie Ala Gly Thr Val Pro lie Thr Val Thr Asn Asn Thr Thr Pro Val 
545 550 555 560 

GTA AGT GAA ACT TTA CCA TCA TTG CCA GTC AAG ATT CGA AAC AAA ATA 172 8 

5 Val Ser Glu Thr Leu Pro Ser Leu Pro Val Lys lie Arg Asn Lys lie 

565 570 575 

GGT GCA TGC GAA TGT GGT GAA TGG ACA GAA TGG ACA GGT CCA TGC TCT 177 6 

Gly Ala Cys Glu Cys Gly Glu Trp Thr Glu Trp Thr Gly Pro Cys Ser 
580 585 590 

10 CAA GAA TGT GGC GGT TGC GGA AAA CGT CTT CGA ACA CGT CAG TGT TCA 182 4 

Gin Glu Cys Gly Gly Cys Gly Lys Arg Leu Arg Thr Arg Gin Cys Ser 
595 600 605 

TCA GAT ACG GAA TGT AGA ACA GAA GAA AAA CGT GCG TGT GCT TTT AAA 18 72 

Ser Asp Thr Glu Cys Arg Thr Glu Glu Lys Arg Ala Cys Ala Phe Lys 
15 610 615 620 

GTT TGC CCA TAC GGG ACT AAT TTC CTT ATC AAT AAT GGA GAG TTT CAT 192 0 

Val Cys Pro Tyr Gly Thr Asn Phe Leu lie Asn Asn Gly Glu Phe His 
625 630 635 640 

ATA CTT TGG AAG GGC TGC TGT GTT GGT CTA TTC CGA TCG GGA GAT ATG 1968 
2 0 lie Leu Trp Lys Gly Cys Cys Val Gly Leu Phe Arg Ser Gly Asp Met 

645 650 655 

TGT TCA GCA CTT GAT GAT AAC GAG AAT CCA TTT CTG AAA TTT CTA GAA 2 016 

Cys Ser Ala Leu Asp Asp Asn Glu Asn Pro Phe Leu Lys Phe Leu Glu 
660 665 670 

25 TCA CTG TTG AAC ATG CAA GAT TCT CGA AAA AAC GAT AAT TTG CCT GAC 2 064 

Ser Leu Leu Asn Met Gin Asp Ser Arg Lys Asn Asp Asn Leu Pro Asp 
675 680 685 

TCG AAA AAG AAG 2 07 6 

Ser Lys Lys Lys 
30 690 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:31: 

Met Lys Gin Val lie lie Phe Pro Gin Leu Phe lie Cys Phe lie Val 
15 10 15 
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Gin Ser Val Glu Asn Tyr Arg Thr Gin Lys Ala Ser Arg Asn Thr Leu 
20 25 30 

Asp His lie Lys Gin Leu lie Thr Leu Asn Val Gin Arg Glu lie Gly 
35 40 45 

Asn lie Asp Asp Lys Thr Leu Ala Asp Glu lie Val Leu Gin Arg Arg 
50 55 60 

Asp Pro Glu Ala Lys Trp His His Asn Glu Leu Phe lie Asn Asp Pro 
65 70 75 80 

Asp Ala Tyr Tyr Gin Gly Asp Val Asp Leu Ser Glu Lys Gin Ala Glu 
85 90 95 

lie Leu Ser Glu His Phe Lys Asn Glu lie Ala Leu Thr Glu Lys Asp 
100 105 110 

Asp Thr lie lie Arg Arg Lys Lys Ser lie Gly Arg Glu Pro Phe Tyr 
115 120 125 

Val Arg Trp Asn His Lys Arg Pro lie Ser Tyr Glu Phe Ala Glu Ser 
130 135 140 

lie Pro Leu Glu Thr Arg Arg Lys lie Arg Ser Ala lie Ala Met Trp 
145 150 155 160 

Glu Glu Arg Thr Cys lie Arg Phe Gin Glu Asn Gly Pro Asn Val Asp 
165 170 175 

Arg lie Glu Phe Tyr Asp Gly Gly Gly Cys Ser Ser Phe Val Gly Arg 
180 185 190 

Thr Gly Gly Thr Gin Gly lie Ser lie Ser Thr Pro Gly Cys Asp lie 
195 200 205 

He Gly He He Ser His Glu He Gly His Thr Leu Gly He Phe His 
210 215 220 

Glu Gin Ala Arg Arg Asp Gin Lys Asn His He Phe He Asn Tyr Asn 
225 230 235 240 

Asn He Pro Ser Ser Arg Trp Asn Asn Phe Phe Pro Leu Ser Glu Tyr 
245 250 255 

Glu Ala Asp Met Phe Asn Leu Pro Tyr Asp Thr Gly Ser Val Met His 
260 265 270 

Tyr Gly Ser Tyr Gly Phe Ala Arg Asn Pro Tyr Glu Pro Thr He Thr 
275 280 285 

Thr Arg Asp Lys Phe Gin Gin Tyr Thr He Gly Gin Arg Glu Gly Pro 
290 295 300 
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Ser Phe Leu Asp Tyr Ala Ser lie Asn Ser Ala Tyr Arg Cys Thr Glu 
305 310 315 320 

Gin Cys Ala Asp Met His Cys Asp His Asn Gly Tyr Pro Asp Pro Asn 
325 330 335 

5 Asn Cys Ala Lys Cys Leu Cys Pro Asp Gly Phe Ala Gly Arg Thr Cys 
340 345 350 

Gin Phe Val Gin Tyr Thr Ser Cys Gly Ala Leu lie Lys Ala Arg Lys 
355 360 365 

Met Pro Val Thr lie Ser Ser Pro Asn Tyr Pro Asn Phe Phe Asn Val 
10 370 375 380 

Gly Asp Gin Cys lie Trp Leu Leu Thr Ala Pro Arg Gly Gly Phe Val 
385 390 395 400 

Asn Leu Gin Phe Val Glu Gin Phe Gin Leu Gin Cys Glu Asp Thr Cys 
405 410 415 

15 Asp Lys Ser Tyr Val Glu Val Lys Ala Asp Ala Asp Phe Arg Pro Thr 
420 425 430 

Gly Tyr Arg Phe Cys Cys Ser Arg Val Pro Arg His lie Phe Gin Ser 
435 440 445 

Ala Thr Asn Glu Met Val Val lie Phe Arg Gly Phe Gly Asp Ala Gly 
20 450 455 460 

Asn Gly Phe Lys Ala Lys lie Trp Ser Asn Val Asp Asp Asp lie Ala 
465 470 475 480 

Asn Thr lie Val Thr Thr Glu Met Ala Lys lie Ser Glu Lys lie Pro 
485 490 495 

25 Lys Leu Thr Val Pro lie Val Lys Thr lie Thr Thr Pro Thr lie Thr 

500 505 510 

Thr Thr Thr Ala Phe Met lie Ser Pro Lys Lys Gly Asn Val Thr Ala 
515 520 525 

Thr Arg Val Ala lie Thr Thr Thr Pro Thr Thr Thr lie Thr Thr Thr 
30 530 535 540 

lie Ala Gly Thr Val Pro lie Thr Val Thr Asn Asn Thr Thr Pro Val 
545 550 555 560 

Val Ser Glu Thr Leu Pro Ser Leu Pro Val Lys lie Arg Asn Lys lie 
565 570 575 

35 Gly Ala Cys Glu Cys Gly Glu Trp Thr Glu Trp Thr Gly Pro Cys Ser 

580 585 590 



154 



2618-21-1-C1 

Gin Glu Cys Gly Gly Cys Gly Lys Arg Leu Arg Thr Arg Gin Cys Ser 
595 600 605 

Ser Asp Thr Glu Cys Arg Thr Glu Glu Lys Arg Ala Cys Ala Phe Lys 
610 615 620 

5 Val Cys Pro Tyr Gly Thr Asn Phe Leu lie Asn Asn Gly Glu Phe His 
625 630 635 640 

lie Leu Trp Lys Gly Cys Cys Val Gly Leu Phe Arg Ser Gly Asp Met 
645 650 655 

Cys Ser Ala Leu Asp Asp Asn Glu Asn Pro Phe Leu Lys Phe Leu Glu 
10 660 665 670 

Ser Leu Leu Asn Met Gin Asp Ser Arg Lys Asn Asp Asn Leu Pro Asp 
675 680 685 

Ser Lys Lys Lys 
690 

15 (2) INFORMATION FOR SEQ ID NO: 32: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2032 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

2 0 (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

TCAGTCAGTG GAAAATTATC GAACGCAGAA AGCATCACGA AATACGTTAG ATCACATCAA 60 

ACAACTTATC ACCTTGAACG TACAAAGAGA GATTGGAAAC ATAGATGATA AGACATTAGC 120 

25 TGATGAAATA GTATTACAAC GACGGGATCC TGAGGCAAAA TGGCATCATA ATGAACTATT 18 0 

CATT7\ATGAT CCAGATGCAT ACTATCAAGG CGATGTCGAT TTGTCGGAAA AACAAGCCGA 240 

AATTCTAAGC GAACATTTTA AAAATGAAAT TGCTTTAACA GAGAAAGACG ACACAATAAT 300 

ACGGCGAAAA AAGAGCATTG GTCGTGAACC ATTTTACGTA AGATGG7\ATC ATAT^CGTCC 360 

CATTAGCTAT GAATTTGCGG AAAGTATTCC ATTAGAAACA CGTAGAAAAA TTCGTTCAGC 420 

3 0 AATAGCAATG TGGGAAGAAC GAACATGCAT ACGATTCCAA GAAT^TGGCC CAAACGTAGA 48 0 

TCGAATTGTA TTTAACGACG GTGGCGGTTG TTCAAGTTTT GTCGGCCGAA CAGGAGGCAC 54 0 

GCCAGGAATT TCAATTTCAA CACCAGGATG TGATATTATT GGTATTATAT CACATGAAAT 60 0 

TGGTCATACT TTAGGAATAT TTCATGAGCA AGCACGTCGT GATCAAAA7\A ATCATATTTT 660 

TATTAATTAC AACAATATTC CATCAAGCCG TTGGAACAAT TTTTTTCCAT TATCAGAATA 72 0 

155 



2618-21'1~C1 

TGAAGCTGAT ATGTTTAATT TACCTTATGA TACAGGATCA GTAATGCACT ATGGTTCATA 7 80 

CGGATTTGCA AGAAATCCGT ATGAACCAAC TATTACAACA CGTGATAAAT TTCAACAGTA 840 

CACAATTGGG CAACGTGAAG GGCCATCATT TCTGGATTAT GCATCTATAA ACAGCGCTTA 9 00 

TCGTTGTACA GAACAATGTG CTGATATGCA CTGCGATCAT AATGGTTATC CGGATCCTAA 9 60 

5 TAATTGCGCG AAATGCTTGT GTCCAGATGG TTTTGCTGGT CGTACCTGTC AATTTGTTCA 102 0 

ATATACATCT TGCGGAGCTC TCATTAAGGC GAGGAAAATG CCTGTTACGA TTTCGAGCCC 108 0 

AAATTATCCA AACTTCTTCA ATGTTGGTGA TCAATGTATT TGGTTGCTTA CAGCTCCACG 1140 

CGGTGGATTC GTAAATTTGC AGTTTGTTGA ACAATTTCAA TTACAATGTG AAGATACGTG 12 00 

TGATAAATCC TATGTAGAAG TGAAAGCTGA CGCTGATTTT CGACCTACTG GATATCGATT 12 60 

10 TTGTTGTTCG CGAGTGCCAC GTCATATTTT TCAATCTGCG ACAAACGAGA TGGTAGTAAT 132 0 

ATTTCGCGGT TTTGGTGGTG CGGGAAATGG CTTTAAAGCT AAAATTTGGT CAAACGTAGA 1380 

TGATGATATA GCTAATACAA TTGTAACAAC TGAAATGGCA AAAATTTCGG AAAAAATACC 1440 

GAAGCTAACA GTTCCAATAG TTA7\AACTAT TACCACTCCT ACAATAACAA CTACTACTGC 1500 

TTTCATGATA TCACCCAAGA AAGGCAATGT CACCGCCACG AGAGTTGCTA TCACTACTAC 1560 

15 GCCGACTACT ACAATTACTA CGACTATTGC CGGTACGGTA CCAATCACCG TAACTAATAA 162 0 

TACTACCCCT GTAGTAAGTG AAACTTTACC ATCATTGCCA GTCAAGATTC GAAACAAAAT 168 0 

AGGTGCATGC GAATGTGGTG AATGGACAGA ATGGACAGGT CCATGCTCTC AAGAATGTGG 17 40 

CGGTTGCGGA AAACGTCTTC GAACACGTCA GTGTTCATCA GATACGGAAT GTAGAACAGA 1800 

AGAAAAACGT GCGTGTGCTT TTAAAGTTTG CCCATACGGG ACTAATTTCC TTATCAATAA 18 60 

2 0 TGGAGAGTTT CATATACTTT GGAAGGGCTG CTGTGTTGGT CTATTCCGAT CGGGAGATAT 192 0 

GTGTTCAGCA CTTGATGATA ACGAGAATCC ATTTCTGAAA TTTCTAGAAT CACTGTTGAA 19 8 0 

CATGCAAGAT TCTCGAAAAA ACGATAATTT GCCTGACTCG ATW^AGAAGT GA 2032 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 2028 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .2028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

5 CAG TCA GTG GAA AAT TAT CGA ACG CAG AAA GCA TCA CGA AAT ACG TTA 4 8 

Gin Ser Val Glu Asn Tyr Arg Thr Gin Lys Ala Ser Arg Asn Thr Leu 
15 10 15 

GAT CAC ATC AAA CAA CTT ATC ACC TTG AAC GTA CAA AGA GAG ATT GGA 96 
Asp His lie Lys Gin Leu lie Thr Leu Asn Val Gin Arg Glu lie Gly 
10 20 25 30 

AAC ATA GAT GAT AAG ACA TTA GCT GAT GAA ATA GTA TTA CAA CGA CGG 144 
Asn lie Asp Asp Lys Thr Leu Ala Asp Glu lie Val Leu Gin Arg Arg 
35 40 45 

GAT CCT GAG GCA AAA TGG CAT CAT AAT GAA CTA TTC ATT AAT GAT CCA 192 
15 Asp Pro Glu Ala Lys Trp His His Asn Glu Leu Phe lie Asn Asp Pro 
50 55 60 

GAT GCA TAC TAT CAA GGC GAT GTC GAT TTG TCG GAA AAA CAA GCC GAA 2 40 

Asp Ala Tyr Tyr Gin Gly Asp Val Asp Leu Ser Glu Lys Gin Ala Glu 
65 70 75 80 

2 0 ATT CTA AGC GAA CAT TTT AAA AAT GAA ATT GCT TTA ACA GAG AAA GAC 2 88 

lie Leu Ser Glu His Phe Lys Asn Glu lie Ala Leu Thr Glu Lys Asp 
85 90 95 

GAC ACA ATA ATA CGG CGA AAA AAG AGC ATT GGT CGT GAA CCA TTT TAC 336 
Asp Thr lie lie Arg Arg Lys Lys Ser lie Gly Arg Glu Pro Phe Tyr 
25 100 105 110 

GTA AGA TGG AAT CAT AAA CGT CCC ATT AGC TAT GAA TTT GCG GAA AGT 384 
Val Arg Trp Asn His Lys Arg Pro lie Ser Tyr Glu Phe Ala Glu Ser 
115 120 125 

ATT CCA TTA GAA ACA CGT AGA AAA ATT CGT TCA GCA ATA GCA ATG TGG 432 

3 0 lie Pro Leu Glu Thr Arg Arg Lys lie Arg Ser Ala lie Ala Met Trp 

130 135 140 

GAA GAA CGA ACA TGC ATA CGA TTC CAA GAA AAT GGC CCA AAC GTA GAT 48 0 

Glu Glu Arg Thr Cys lie Arg Phe Gin Glu Asn Gly Pro Asn Val Asp 
145 150 155 160 

35 CGA ATT GTA TTT AAC GAC GGT GGC GGT TGT TCA AGT TTT GTC GGC CGA 52 8 

Arg lie Val Phe Asn Asp Gly Gly Gly Cys Ser Ser Phe Val Gly Arg 
165 170 175 

ACA GGA GGC ACG CCA GGA ATT TCA ATT TCA ACA CCA GGA TGT GAT ATT 57 6 

Thr Gly Gly Thr Pro Gly lie Ser lie Ser Thr Pro Gly Cys Asp lie 
40 180 185 190 
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ATT GGT ATT ATA TCA CAT GAA ATT GGT CAT ACT TTA GGA ATA TTT CAT 624 
lie Gly lie lie Ser His Glu lie Gly His Thr Leu Gly lie Phe His 
195 200 205 

GAG CAA GCA CGT CGT GAT CAA AAA AAT CAT ATT TTT ATT AAT TAG AAC 672 
5 Glu Gin Ala Arg Arg Asp Gin Lys Asn His lie Phe lie Asn Tyr Asn 
210 215 220 

AAT ATT CCA TCA AGC CGT TGG AAC AAT TTT TTT CCA TTA TCA GAA TAT 72 0 

Asn lie Pro Ser Ser Arg Trp Asn Asn Phe Phe Pro Leu Ser Glu Tyr 
225 230 235 240 

10 GAA GCT GAT ATG TTT AAT TTA CCT TAT GAT ACA GGA TCA GTA ATG CAC 7 68 

Glu Ala Asp Met Phe Asn Leu Pro Tyr Asp Thr Gly Ser Val Met His 
245 250 255 

TAT GGT TCA TAC GGA TTT GCA AGA AAT CCG TAT GAA CCA ACT ATT ACA 816 
Tyr Gly Ser Tyr Gly Phe Ala Arg Asn Pro Tyr Glu Pro Thr lie Thr 
15 260 265 270 

ACA CGT GAT AAA TTT CAA CAG TAC ACA ATT GGG CAA CGT GAA GGG CCA 8 64 

Thr Arg Asp Lys Phe Gin Gin Tyr Thr lie Gly Gin Arg Glu Gly Pro 
275 280 285 

TCA TTT CTG GAT TAT GCA TCT ATA AAC AGC GCT TAT CGT TGT ACA GAA 912 
2 0 Ser Phe Leu Asp Tyr Ala Ser lie Asn Ser Ala Tyr Arg Cys Thr Glu 
290 295 300 

CAA TGT GCT GAT ATG CAC TGC GAT CAT AAT GGT TAT CCG GAT CCT AAT 9 60 

Gin Cys Ala Asp Met His Cys Asp His Asn Gly Tyr Pro Asp Pro Asn 
305 310 315 320 

25 AAT TGC GCG AAA TGC TTG TGT CCA GAT GGT TTT GCT GGT CGT ACC TGT 10 0 8 

Asn Cys Ala Lys Cys Leu Cys Pro Asp Gly Phe Ala Gly Arg Thr Cys 
325 330 335 

CAA TTT GTT CAA TAT ACA TCT TGC GGA GCT CTC ATT AAG GCG AGG AAA 1056 
Gin Phe Val Gin Tyr Thr Ser Cys Gly Ala Leu lie Lys Ala Arg Lys 
30 340 345 350 

ATG CCT GTT ACG ATT TCG AGC CCA AAT TAT CCA AAC TTC TTC AAT GTT 1104 
Met Pro Val Thr lie Ser Ser Pro Asn Tyr Pro Asn Phe Phe Asn Val 
355 360 365 

GGT GAT CAA TGT ATT TGG TTG CTT ACA GCT CCA CGC GGT GGA TTC GTA 1152 
35 Gly Asp Gin Cys lie Trp Leu Leu Thr Ala Pro Arg Gly Gly Phe Val 
370 375 380 

AAT TTG CAG TTT GTT GAA CAA TTT CAA TTA CAA TGT GAA GAT ACG TGT 12 00 

Asn Leu Gin Phe Val Glu Gin Phe Gin Leu Gin Cys Glu Asp Thr Cys 
385 390 395 400 

4 0 GAT AAA TCC TAT GTA GAA GTG AAA GCT GAC GCT GAT TTT CGA CCT ACT 1248 
Asp Lys Ser Tyr Val Glu Val Lys Ala Asp Ala Asp Phe Arg Pro Thr 
405 410 415 
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GGA TAT CGA. TTT TGT TGT TCG CGA GTG CCA CGT CAT ATT TTT CAA TCT 12 96 

Gly Tyr Arg Phe Cys Cys Ser Arg Val Pro Arg His lie Phe Gin Ser 
420 425 430 

GCG ACA AAC GAG ATG GTA GTA ATA TTT CGC GGT TTT GGT GGT GCG GGA 1344 
5 Ala Thr Asn Glu Met Val Val lie Phe Arg Gly Phe Gly Gly Ala Gly 
435 440 445 

AAT GGC TTT AAA GCT AAA ATT TGG TCA AAC GTA GAT GAT GAT ATA GCT 13 92 

Asn Gly Phe Lys Ala Lys lie Trp Ser Asn Val Asp Asp Asp lie Ala 
450 455 460 

10 AAT ACA ATT GTA ACA ACT GAA ATG GCA AAA ATT TCG GAA AAA ATA CCG 1440 
Asn Thr lie Val Thr Thr Glu Met Ala Lys lie Ser Glu Lys lie Pro 
465 470 475 480 

AAG CTA ACA GTT CCA ATA GTT AAA ACT ATT ACC ACT CCT ACA ATA ACA 148 8 

Lys Leu Thr Val Pro lie Val Lys Thr lie Thr Thr Pro Thr lie Thr 
15 485 490 495 

ACT ACT ACT GCT TTC ATG ATA TCA CCC AAG AAA GGC AAT GTC ACC GCC 1536 
Thr Thr Thr Ala Phe Met lie Ser Pro Lys Lys Gly Asn Val Thr Ala 
500 505 510 

ACG AGA GTT GCT ATC ACT ACT ACG CCG ACT ACT ACA ATT ACT ACG ACT 1584 
2 0 Thr Arg Val Ala lie Thr Thr Thr Pro Thr Thr Thr lie Thr Thr Thr 
515 520 525 

ATT GCC GGT ACG GTA CCA ATC ACC GTA ACT AAT AAT ACT ACC CCT GTA 1632 
lie Ala Gly Thr Val Pro lie Thr Val Thr Asn Asn Thr Thr Pro Val 
530 535 540 

2 5 GTA AGT GAA ACT TTA CCA TCA TTG CCA GTC AAG ATT CGA AAC AAA ATA 168 0 

Val Ser Glu Thr Leu Pro Ser Leu Pro Val Lys lie Arg Asn Lys lie 
545 550 555 560 

GGT GCA TGC GAA TGT GGT GAA TGG ACA GAA TGG ACA GGT CCA TGC TCT 172 8 

Gly Ala Cys Glu Cys Gly Glu Trp Thr Glu Trp Thr Gly Pro Cys Ser 
30 565 570 575 

CAA GAA TGT GGC GGT TGC GGA AAA CGT CTT CGA ACA CGT CAG TGT TCA 177 6 

Gin Glu Cys Gly Gly Cys Gly Lys Arg Leu Arg Thr Arg Gin Cys Ser 
580 585 590 

TCA GAT ACG GAA TGT AGA ACA GAA GAA AAA CGT GCG TGT GCT TTT AAA 1824 
35 Ser Asp Thr Glu Cys Arg Thr Glu Glu Lys Arg Ala Cys Ala Phe Lys 
595 600 605 

GTT TGC CCA TAC GGG ACT AAT TTC CTT ATC AAT AAT GGA GAG TTT CAT 18 72 

Val Cys Pro Tyr Gly Thr Asn Phe Leu lie Asn Asn Gly Glu Phe His 
610 615 620 

4 0 ATA CTT TGG AAG GGC TGC TGT GTT GGT CTA TTC CGA TCG GGA GAT ATG 192 0 

lie Leu Trp Lys Gly Cys Cys Val Gly Leu Phe Arg Ser Gly Asp Met 
625 630 635 640 
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TGT TCA GCA CTT GAT GAT AAC GAG AAT CCA TTT CTG AAA TTT CTA GAA 19 68 

Cys Ser Ala Leu Asp Asp Asn Glu Asn Pro Phe Leu Lys Phe Leu Glu 
645 650 655 

TCA CTG TTG AAC ATG CAA GAT TCT CGA AAA AAC GAT AAT TTG CCT GAC 2016 
5 Ser Leu Leu Asn Met Gin Asp Ser Arg Lys Asn Asp Asn Leu Pro Asp 
660 665 670 

TCG AAA AAG AAG 2028 
Ser Lys Lys Lys 
675 

10 (2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Gin Ser Val Glu Asn Tyr Arg Thr Gin Lys Ala Ser Arg Asn Thr Leu 
15 10 15 

Asp His lie Lys Gin Leu lie Thr Leu Asn Val Gin Arg Glu lie Gly 
20 20 25 30 

Asn lie Asp Asp Lys Thr Leu Ala Asp Glu lie Val Leu Gin Arg Arg 
35 40 45 

Asp Pro Glu Ala Lys Trp His His Asn Glu Leu Phe lie Asn Asp Pro 
50 55 60 

25 Asp Ala Tyr Tyr Gin Gly Asp Val Asp Leu Ser Glu Lys Gin Ala Glu 
65 70 75 80 

lie Leu Ser Glu His Phe Lys Asn Glu lie Ala Leu Thr Glu Lys Asp 
85 90 95 

Asp Thr lie lie Arg Arg Lys Lys Ser lie Gly Arg Glu Pro Phe Tyr 
30 100 105 110 

Val Arg Trp Asn His Lys Arg Pro lie Ser Tyr Glu Phe Ala Glu Ser 
115 120 125 

lie Pro Leu Glu Thr Arg Arg Lys lie Arg Ser Ala lie Ala Met Trp 
130 135 140 

35 Glu Glu Arg Thr Cys lie Arg Phe Gin Glu Asn Gly Pro Asn Val Asp 
145 150 155 160 
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Arg lie Val Phe Asn Asp Gly Gly Gly Cys Ser Ser Phe Val Gly Arg 
165 170 175 

Thr Gly Gly Thr Pro Gly lie Ser lie Ser Thr Pro Gly Cys Asp lie 
180 185 190 

5 lie Gly lie lie Ser His Glu He Gly His Thr Leu Gly He Phe His 
195 200 205 

Glu Gin Ala Arg Arg Asp Gin Lys Asn His He Phe He Asn Tyr Asn 
210 215 220 

Asn He Pro Ser Ser Arg Trp Asn Asn Phe Phe Pro Leu Ser Glu Tyr 
10 225 230 235 240 

Glu Ala Asp Met Phe Asn Leu Pro Tyr Asp Thr Gly Ser Val Met His 
245 250 255 

Tyr Gly Ser Tyr Gly Phe Ala Arg Asn Pro Tyr Glu Pro Thr He Thr 
260 265 270 

15 Thr Arg Asp Lys Phe Gin Gin Tyr Thr He Gly Gin Arg Glu Gly Pro 
275 280 285 

Ser Phe Leu Asp Tyr Ala Ser He Asn Ser Ala Tyr Arg Cys Thr Glu 
290 295 300 

Gin Cys Ala Asp Met His Cys Asp His Asn Gly Tyr Pro Asp Pro Asn 
20 305 310 315 320 

Asn Cys Ala Lys Cys Leu Cys Pro Asp Gly Phe Ala Gly Arg Thr Cys 
325 330 335 

Gin Phe Val Gin Tyr Thr Ser Cys Gly Ala Leu He Lys Ala Arg Lys 
340 345 350 

25 Met Pro Val Thr He Ser Ser Pro Asn Tyr Pro Asn Phe Phe Asn Val 
355 360 365 

Gly Asp Gin Cys He Trp Leu Leu Thr Ala Pro Arg Gly Gly Phe Val 
370 375 380 

Asn Leu Gin Phe Val Glu Gin Phe Gin Leu Gin Cys Glu Asp Thr Cys 
30 385 390 395 400 

Asp Lys Ser Tyr Val Glu Val Lys Ala Asp Ala Asp Phe Arg Pro Thr 
405 410 415 

Gly Tyr Arg Phe Cys Cys Ser Arg Val Pro Arg His He Phe Gin Ser 
420 425 430 

35 Ala Thr Asn Glu Met Val Val He Phe Arg Gly Phe Gly Gly Ala Gly 
435 440 445 
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Asn Gly Phe Lys Ala Lys lie Trp Ser Asn Val Asp Asp Asp lie Ala 
450 455 460 

Asn Thr lie Val Thr Thr Glu Met Ala Lys lie Ser Glu Lys lie Pro 
465 470 475 480 

5 Lys Leu Thr Val Pro lie Val Lys Thr lie Thr Thr Pro Thr lie Thr 

485 490 495 

Thr Thr Thr Ala Phe Met lie Ser Pro Lys Lys Gly Asn Val Thr Ala 
500 505 510 

Thr Arg Val Ala lie Thr Thr Thr Pro Thr Thr Thr lie Thr Thr Thr 
10 515 520 525 

lie Ala Gly Thr Val Pro lie Thr Val Thr Asn Asn Thr Thr Pro Val 
530 535 540 

Val Ser Glu Thr Leu Pro Ser Leu Pro Val Lys lie Arg Asn Lys lie 
545 550 555 560 

15 Gly Ala Cys Glu Cys Gly Glu Trp Thr Glu Trp Thr Gly Pro Cys Ser 

565 570 575 

Gin Glu Cys Gly Gly Cys Gly Lys Arg Leu Arg Thr Arg Gin Cys Ser 
580 585 590 

Ser Asp Thr Glu Cys Arg Thr Glu Glu Lys Arg Ala Cys Ala Phe Lys 
20 595 600 605 

Val Cys Pro Tyr Gly Thr Asn Phe Leu lie Asn Asn Gly Glu Phe His 
610 615 620 

lie Leu Trp Lys Gly Cys Cys Val Gly Leu Phe Arg Ser Gly Asp Met 
625 630 635 640 

25 Cys Ser Ala Leu Asp Asp Asn Glu Asn Pro Phe Leu Lys Phe Leu Glu 

645 650 655 

Ser Leu Leu Asn Met Gin Asp Ser Arg Lys Asn Asp Asn Leu Pro Asp 
660 665 670 

Ser Lys Lys Lys 
30 675 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
35 ( C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

CATCTCGAGA TCAGTGGAAA ATTATCGAAC G 31 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

ATTGAATTCA CTTCTTTTTC GAGTCAGGCA A 31 

While various embodiments of the present invention have 
been described in detail, it is apparent that modifications 
and adaptations of those embodiments will occur to those 
15 skilled in the art. It is to be expressly understood, 
however, that such modifications and adaptations are within 
the scope of the present invention, as set forth in the 
following claims: 
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What is claimed is : 

1. An isolated protein comprising a parasite astacin 
metalloendopeptidase protein. 

2. The protein of Claim 1^ wherein said protein^, when 
administered to an animal in arT effective manner;, elicits an 
immune response against a parasite astacin 
metalloendopeptidase . 

3. The protein of Claim ^i, wherein said parasite is 

y 

selected from the group consisting of parasitic helminths^ 
protozoan parasites and ectoparasites, 

4. The protein of Claim 1, wherein said parasite 
comprises a tissue-migrating helminth. 

5. The protein of Claim 1, wherein said parasite is 
selected from the group consisting of nematodes, cestodes and 
trematodes . 

6. The protein of Claim 1, wherein said parasite 
comprises a nematode selected from the group consisting of 
filariid, ascarid, strongyle and trichostrongyle nematodes. 

7. The protein of Claim 1, wherein said parasite 
comprises a filariid nematode .-'^selected from the group 
consisting of Dlrofilaria, Acanthocheilonema, Bruqia, 
Dipetalonemaf Loa, Onchocerca, Parafilaria, Setaria, 

5 Stephanofilaria and Wuchereria filariid nematodes. 
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8. The protein of Claim 1, wherein said parasite 
comprises D, immi tis. ^.-^^ 

9. The protein of Claim 1, wherein said protein is 
encoded by a parasite nucleic acid" molecule which hybridizes 
under stringent conditions with a D. immitis astacin 
metalloendopeptidase gene • 

10. The protein of Claim 1, wherein said protein 
comprises an amino acid sequence"^ having at least about 4 0 
percent homology with an amino acid sequence selected from the 
group consisting of SEQ ID NO: 31 and SEQ ID NO: 34. 

11. The protein of Claim 1, wherein said protein 
comprises at least a portion of at least one amino acid 
sequence selected from the group consisting of SEQ ID NO: 3, 
SEQ ID N0:4, SEQ ID N0:5, SEQ ID NO: 6, SEQ ID N0:7, SEQ ID 

5 NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 31 and SEQ ID 

NO: 34, wherein said portion is encoded by a nucleic acid 
molecule which hybridizes under stringent conditions with a 
nucleic acid molecule selected from the group consisting of 
nDiMPAli299/ nDiMPA22i26/ L3 nDlMPAS 

2292, nDiMPA32076/ adult 

10 nDiMPA32032. and adult nDiMPA32028 - 

12. The protein of Claim 1, . wherein said protein 
comprises an extended zinc-binding^ domain motif. 
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13. The protein of Claim 1, wherein said protein is 
produced by a process comprising^ culturing in an effective 
medium a recombinant cell transformed with a nucleic acid 
molecule encoding said protein to produce said protein* 

14. The protein of Claim 1,^ wherein said protein is used 
to identify an inhibitor of^ astacin metalloendopeptidase 
activity. 

15. An isolated antibody capable of selectively binding 
to a protein set forth in Claim 1. 



166 



2618-21~1-C1 

15, A therapeutic composition for protecting an animal 
from .-'disease caused by a parasite, said parasite being 
susceptible to an inhibitor of an astacin 
metalloendopeptidase, said therapeutic composition comprising 
at least one protective compound selected from the group 
consisting of: an isolated parasite astacin 

metal loendopeptidase protein; an anti-parasite astacin 
metalloendopeptidase antibody; and an inhibitor of astacin 
metalloendopeptidase activity identified by its ability to 
inhibit parasite astacin metalloendopeptidase activity of said 
protein. 

17, The composition of Claim 16, wherein said 
composition further comprises at least one component selected 
from the group consisting of an excipient, an adjuvant and a 
carrier . 

18, The composition of Claim 16, wherein said parasite 
comprises a tissue-migrating helminth. 

19, The composition of Claim 16, wherein said disease 
comprises heartworm infection. 
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20. A method to protect an animal from disease caused by 
a parasite, said parasite being susceptible to an inhibitor of 
an astacin metalloendopeptidase, said method comprising 
administering to said animal a therapeutic composition 
comprising at least one protective compound selected from the 
group consisting of: an isolated parasite astacin 
metalloendopeptidase protein; an anti-parasite astacin 
metalloendopeptidase antibody; and an inhibitor of astacin 
metalloendopeptidase activity identified by its ability to 
inhibit parasite astacin metalloendopeptidase activity of said 
protein. 

21. The method of Claim 20, wherein said parasite 
comprises a tissue-migrating helfninth. 

22. The method of Claim 20, wherein said disease 

comprises heartworm infection. -"^ 
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22., -K method to identify a compound capable of 
inhi-lDiting astacin metalloendopeptidase activity of a 
parasite, said method comprising: 

(a) contacting an isolated parasite astacin 
5 metalloendopeptidase protein with a putative inhibitory 

compound under conditions in which, in the absence of 
said compound, said astacin metalloendopeptidase protein 
has astacin metalloendopeptidase activity; and 

(b) determining if said putative inhibitory compound 
10 inhibits said activity. 
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24. A test kit to identify a compound capable of 
inhibiting astacin metalloendopeptidase activity of a 
parasite, said test kit comprising an isolated parasite 
astacin metalloendopeptidase protein having astacin 
metalloendopeptidase activity and a means for determining the 
extent of inhibition of said activity in the presence of a 
putative inhibitory compound. 
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Abstract 

The present invention relates to parasite astacin 
metalloendopeptidase proteins, nucleic acid molecules having 
sequences that encode such proteins, antibodies raised against 
5 such proteins and compounds that can inhibit the activities of 
parasite astacin metalloendopeptidases . The present invention 
also includes methods to obtain such nucleic acid molecules, 
proteins, antibodies and inhibitors. The present invention 
also includes therapeutic compositions comprising such nucleic 
10 acid molecules, proteins, antibodies and inhibitors as well as 
their use to protect animals from disease caused by parasites, 
such as heartworm infection. 
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